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We review the theory of inflationary perturbations. Perturbations at both linear and nonlinear 
orders are reviewed. We also review a variety of inflation models, emphasizing their signatures on 
cosmic perturbations. 
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I. AN INTRODUCTION TO INFLATION 

Inflation [IH3] is the leading paradiam of the very early universe cosmology. The quantum fluctuations during 
inflation are stretched to super-Hubble scales and return to the Hubble horizon after inflation. They provide the seeds 
of CMB anisotropy and the large scale structure [3HH] ■ 

Here we review the cosmic perturbation theory, which calculates inflationary fluctuations. There have been already 
many books, reviews and lecture notes in this area [9-22 . In the present review we focus on an explicit treatment 
of the standard theory, including linear and non-linear perturbations. After that, we make a broad survey of models. 
Before those aspects, we briefly overview the inflationary cosmology in the rest of this section. 

A. The hot big bang 

The dynamics of the standard hot big bang cosmology is governed by the Friedmann equation. By assuming the 
universe is homogeneous, isotropic 1 and flat 2 , the metric can be written as the FRW form: 

ds 2 = -dt 2 + a(t) 2 dx 2 . (1) 



1 Homogeneity and isotropy are firstly introduced just to simplify the equations. Later it is supported by observations that our universe 
is indeed nearly homogeneous and isotropic above 100 Mpc scales (and the size of our observable universe is about 3000 Mpc). 

2 Unlike homogeneity or isotropy, flatness is not a statement of symmetry. Nevertheless experiments suggest that our universe is very 
fiat. Non-flatness, if exists, should have a curvature radius at least 10H~ 1 . Thus for algebraic simplicity we also assume flatness unless 
stated otherwise. 
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Here a is the scale factor of the universe. Note that in a fiat universe, the scale factor itself does not have physical 
meaning. Because we can always rescale a and spatial coordinate Xi together. On the other hand, ratios of scale 
factors are important. For example the Hubble parameter H = a/a and the redshift z = a(to)/a(t) — 1 are key 
quantities in cosmology. 

With the FRW metric, the 00-component of the Einstein equation is the Friedmann equation: 

3M 2 H 2 = p , (2) 

where p='Y^ l Pi = p m + Pr + Pa + ■ ■ ■ is the energy density. For non-interacting energy components, the time evolution 
(characterizing how this component dilutes with expansion of the universe) at the homogeneous and isotropic level is 
determined by the continuity equation 

p + 3H(p + p) = , (3) 

where p is pressure, which can be determined by p (and perhaps other parameters) from the equation of state for this 
form of matter. For example, for thermal matter, p/p can be calculated from the thermal partition function. For the 
cosmological constant, p/p = —1 follows from Lorentz invariance. 

Our present universe is matter (p -C p) dominated. At earlier times, the universe was hotter and radiation (p = p/3) 
dominated. However, radiation domination cannot be eternal to the past. This is because from Eq. ([2]), inserting 
p = p r oc a -4 , we get a oc 1/a and a oc \/t — t- u where t{ is an integration constant. The moment t = t{ is a physical 
singularity, where scale factor sinks to zero and physical energy density diverges. Note that t is the proper time 
measured by a comoving observer. Thus the radiation dominated universe must start at some finite physical time 
(137Gyr ago). The start of the radiation dominated universe is known as the "hot big bang". 

The singularity at t = t\ requires the hot big bang. But this singularity by itself does not imply what happens 
before the hot big bang, or even if anything, including spacetime, exists or not before that. Nevertheless, the hot big 
bang cosmology is sensitive to its initial conditions. There are strong evidences (almost firm proof) showing that new 
physics exists before the big bang. And inflation is so far the most successful candidate for this very early epoch. In 
the remainder of the section, we shall focus on those evidences and how inflation explains them. 



B. Horizon, flatness, monopole and correlation problems 

It is a surprising fact that we find ourselves living in a huge universe - about 10 23 kilometers in the observable 
radius and 10 10 years in age (where kilometers and years are human's natural units). This turns into a greater surprise 
when we realize that our universe is supported by fundamental physics, where the natural scales are eV, GeV or even 
Planck scale. In those units the age and size of our universe are about 10 33 eV _1 (~ 10 42 GeV _1 ~ 10 60 Af~ 1 ). 

Those surprises can be formulated in more accurate terms, by the horizon and flatness problems of the standard 
hot big bang cosmology. To see those problems, let us assume for a time that nothing happened before the hot big 
bang. In other words assume the universe is radiation dominated all the way back in the early times. Then problems 
arise: 

• Horizon problem (I): Let us start with an example of the cosmic microwave background (CMB). 

From observations, we actually see the universe is homogeneous and isotropic in the past, because light (or any 
other causal signal) needs time to propagate to us. By optical probes, we can observe the homogeneity and 
isotropy of the universe up to as early as recombination: When the CMB forms, the universe is already very 
homogeneous and isotropic (up to fluctuations of order 10 -5 , which we will discuss soon). 

However, from the hot big bang to the time of recombination, causal signals could only propagate a limited 
distance in space. This is because those signals could at most propagate at speed of light, with null geodesic 
= ds 2 = —dt 2 + a 2 dr 2 . Thus the physical distance of propagation (measured by today's ruler a(to}) from the 
big bang to recombination is at most 3 

a(io)Ar = a(tn) / — t-t — ^VU-ccto ~ \ ~~ x ( srze of current observable universe) . (4) 
Jti °W V *o 



Here we approximated the universe as radiation dominated. Actually the universe has already become matter dominated at z ~ 3000. 
Nevertheless this doesn't make a big difference for this estimate. 
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To put in numbers, the causal signals, from the hot big bang to recombination, could only propagate as long 
as about 1/100 of the radius of our observable universe. Thus different patches of the CMB sky were casually 
disconnected. This situation is illustrated in Fig. [T] 

One could also consider this problem using conformal time. This is because conformal time, together with 
comoving spatial coordinates, straightforwardly shows the causal structure of spacetime. 4 The flat FRW metric 
can be rewritten as 

ds 2 = a(T) 2 (-dT 2 + dx 2 ) , (5) 

where r is defined (up to a constant) by dt = adr. Using conformal time, the light cones are 45 degree straight 
lines in the t-x plane. Thus the distance luminal signals travel equals to conformal time. 

2 

Consider equation of state p = wp, where to is a constant. One finds a oc (t — to) 3 ( 1+ ™i . Thus 

3(1 +uu) , . i±3tg 
t-t = ! > (t-to ^™ . 6 
1 + Sw 

For a fluid with w > —1/3 (including matter and radiation), a very short period of proper time t— to corresponds 
to a very short period of conformal time r — tq. Thus light cannot propagate far during this period of time. 





FIG. 1: The horizon problem of hot big bang cosmology. The space and time in this diagram should be understood as comoving 
and conformal respectively. The several smaller light cones are truncated between the hot big bang and recombination (the 
time of CMB formation). Those light cones have no overlap at the time of hot big bang, thus they are causally unrelated at 
recombination. But why, they at recombination (for example A and B in the figure), look similar to each other (considering 
we can observe many of them, as indicated by the greater light cone between recombination and the present time)? 

Note that in the CMB example of the horizon problem, i rec is set by the scale of observation instead of the 
intrinsic nature of the problem itself. More generally, we could consider different observables (replace t ICC 
with other time scales) and generate a family of similar horizon problems. For example, problems of causally 
disconnect patches of large scale structure. 

• Horizon problem (II): One could alternatively formulate the puzzle of horizon size in another way. Imagine 
we are observers sitting at x = 0, who have existed since the hot big bang, and there exists some structural 
information S which are comoving 5 in the universe (including homogeneity, isotropy and flatness, and departures 
from those ideal conditions). We could observe S by luminal or sub-luminal signals emitted from the structures 
on S. Now at time £ bs (not before or after), how much portion of S could we observe (in comoving size)? 



4 Conformal time is not only important for showing causal structures. There are also a few other important usages of conformal time. 
For example, the field equations looks simpler and closer to Minkowski field equations in conformal time. Also, it can be used to show 
that conformal invariant fields do not "know" the cosmic expansion. We will discuss those aspects more extensively in later sections 

5 The requirement of comoving is not only for simplicity. Motions away from comoving receives Hubble fraction (the kinetic energy loss, 
analog of redshift for a massless particle). Thus on large scales, objects tend to be comoving. 
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We find there is a competition in this situation. On the one hand, as t h a becomes larger, the past light cone is 
larger thus we tend to see more structures on S. On the other hand, the expansion of the universe itself tends 
to hide those structures into more distant regions, where we may not be able to observe. Interestingly, those 
two effects depends on different order of derivatives of a. Thus typically there is a winner between them. The 
comoving size of the observable universe is characterized by the comoving horizon size: 



We find that as we move forward in time, we see more and more portion of S. In other words, comoving 
objects are floating inwards from outside of our horizon 6 . Or from the horizon point of view, the universe is not 
expanding "fast" enough, and thus the horizon keeps expanding in comoving size. (Here we have quoted "fast" , 
because the time dependence of the quantity aH — a is actually the acceleration of expansion, instead of the 
expansion speed.) 

Now comes the horizon problem: If no one could never "see" (more accurately, interact with) those structures 
in the early universe, how could they be causally prepared, and prepared in a consistent way? This situation is 
illustrated in Fig. [2] and Fig. [3] 





FIG. 2: The horizon problem II, or the flatness problem. At early times, the observer could only observe a small portion of the 
universe thus unaware of larger scale structures (e.g. the spherical topology in the figure). At later times though radiation or 
matter domination, the universe expands. However, the observer's horizon expands faster. Thus the observer could obtain more 
large scale structural information (e.g. easier to measure the non-flatness of the universe, if exists). From current observation, 
the universe is rather flat now. If there were non-flatness in our universe, how could causal process produce them in the early 
universe, considering no causal process could see this non-flatness in the early universe? 



• Flatness problem: The flatness problem could be considered as an example of the version (II) of the horizon 
problem. Here the structural information S is the spatial curvature of the universe. For simplicity, we haven't 
put spatial curvature into the metric. Nevertheless, we could imagine we live on a sphere instead of a flat plane. 

When the universe expands, the curvature radius 1Z should expand together, keeping 1Z = a(t) (with proper 
normalization of a). Nevertheless, as horizon problem (II), in matter or radiation dominated universes, the 
universe does not expand "fast" enough thus we still see more and more portion of the sphere as time goes on. 



6 The horizon here is defined in the sense of the above paragraph. This horizon is sometimes called the "cosmological horizon". The 
horizon size, in comoving coordinate, is l/(aH). This is not to be confused with the concepts of the particle horizon or the event horizon. 
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FIG. 3: The comoving scales return to the horizon in radiation and matter dominated eras. 



So far, we haven't seen any curvature (1Z > 10a). Thus at a much earlier time, 1Z has to be greater than 10a 
by a much greater amount. How could such a flat universe be prepared at the beginning, without the help of 
causal processes? 

This is not yet the end of the story. Some UV-completion theories of the particle physics standard model predict 
topological defects. From those theories, another question arises: 

• Monopole problem: Phase transitions of the early universe may produce gauge theory monopoles. If produced, 
they are supposed to dominate the energy density the universe. But we haven't found them. (On the other 
hand, global monopoles is not necessarily a problem.) 

So far we have outlined a few problems for the standard picture of the hot big bang cosmology. However, it is worth 
noticing that sometimes it is hard to judge the seriousness of conceptional problems: 

On the one hand, it is because those conceptional problems are paradigm dependent. For example, the monopole 
problem could be simply solved by starting with a theory without monopole solutions. One could as well argue that 
the universe could start from some almost deterministic initial conditions where the universe is extremely flat and 
uniform. Those theories exist, such as the quantum creation of universe j^Hj (see also [M[[25] for a different approach 
with different predictions). 

On the other hand, solution to some problems may not be complete, or may also raise other (though arguably less 
serious) problems. We will come back later to the new problems brought by inflation. 

However, there does exist physical problem of the hot big bang, without proper extension before the bang. This is 
the problem 7 of large scale correlations on the CMB (or in the large scale structure). 

• Correlation problem: Again consider Fig.[T] Actually the CMB map is not absolutely homogeneous and isotropic, 
but instead has small fluctuations on it, with an amplitude ~ 10~ 5 . Now consider the statistical correlation of 
those fluctuations, in regions A and B. Assuming nothing exists before the hot big bang, A and B shouldn't 
have correlations. However, those statistical correlations were caught, mode by mode in momentum space, in 
the CMB experiments [26]. With nowadays CMB map (WMAP:[27l [28] ACT:[29H3l] SPT:[32] [33]), one can 
actually observe by eyes that the fluctuations are very different from a map generated from white noise. 

Indeed, historically it is the verified predictions of inflation from the correlation in the cosmic perturbations, instead 
of the conceptual "postdictions" , that convinced the majority that inflation should be the theory of the very early 
universe and produce the dominant seed of CMB and structure formation. 

C. Inflation as a solution of problems 

Inflation is a period of time before the hot big bang, during which the universe undergoes accelerating expansion: 

a > . (8) 



7 Historically, the large scale correlation was no longer a problem of the bang, but already the success of inflation. This is because those 
correlations are detected after inflation was proposed. 
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For example, consider a constant equation of state p — wp. In this case acceleration implies w < —1/3. And as we 
shall see, inflation typically require w ~ — I (approach -1 from the greater side). 

Before going into detailed dynamics which could drive the above accelerating expansion, let us first have a look 
how inflation solves the problems in the previous subsection. 

• Horizon problem (I): From equation ([6]), we find that in case of acceleration (w < —1/3), r — r cx —it — 
i ) negatlve powcr . Thus small proper time interval t — to —> means negative and large conformal interval 

T — To — y — OO. 

Remember that t — To corresponds to the distance that light ray could travel, r — To — > — oo means that light 
ray could travel a very long distance during the w < —1/3 epoch (although could be a small interval of proper 
time (t — to)) before reaching the time of recombination. Thus Fig. [I] is replaced by Fig. |4j where the initial t\ 
hypersurface is stretched downwards in conformal distance, and light cones with tips on CMB becomes highly 
overlapped at tj. 

Alternatively, we can re-calculate the light propagation distance, similarly to Eq. Q: during inflation 

a(to)Ar = a(t ) _L ^L mu^ . (9) 

J tini a(t) H a{t in i) H a(t cnd ) 

This corresponds to an exponentially large distance, that light signals could have travelled during inflation. 

• Horizon problem (II) and the flatness problem: Those problems arises because a = aH was a decreasing function 
all the way since the creation of the universe (i.e. a < 0). Now we have a period of inflation a > before the 
hot big bang. Thus the horizon problem (II) and the flatness problem are solved provided that the a > period 
is long enough. This situation is illustrated in Fig. |4j where the left and right panels plots the same physics, in 
spacetime diagram and in terms of change of comoving horizon scale, respectively. 




FIG. 4: Inflation solves the horizon problem. Thanks to the accelerated expansion, the conformal time t is no longer tightly 
bounded by the big bang singularity t = 0. Thus the past light cones on the CMB overlap in the past, which solves the horizon 
problem. 



• Monopole problem: If monopoles were created before inflation, they could easily be inflated away. Thus it is no 
wonder we can not observe them today. (However, one should still pay attention not to produce gauge theory 
monopoles, domain walls or very strong cosmic strings after inflation.) 

• Correlation problem: How inflation solves this problem is discussed in detail in later sections. The key idea 
is that, the sub-Hubble quantum fluctuations are stretched by cosmic expansions during inflation, into super- 
Hubble fluctuations. Those fluctuations return to the horizon later after the end of inflation. Those returned 
modes cause large scale correlations on the CMB. This is illustrated in Fig. [4] 



comoving scales comoving scales 




FIG. 5: The comoving scales exits the horizon during inflation, and later returns during radiation and matter dominated eras. 

D. Dynamics of inflation 

Inflation is clearly a very powerful tool for solving problems with standard cosmology, so let us now proceed to see 
how inflation can be explicitly realized. The requirement p/p — w < —1/3 is not that easy to obtain. Because our 
familiar components in the universe have positive pressure. For example, consider a gas of particles in a box. The 
particles always "want" to escape the box, and hence generate positive pressure by hitting the surface of the box. 

The same difficulty can be also observed from the continuity equation p + 3H(l + w)p = 0. Again consider a gas 
component (but now in the whole universe). Both particle density and momentum of particles get diluted when the 
universe expands. Thus p should dilute at least as quickly as a~ 3 . Especially, when w ~ —1, we want a component 
whose energy density is almost not diluted when the universe expands. 

To have such an acceleration, one immediate idea is to use vacuum energy. This is because, the vacuum, after 
adiabatic expansion, is still vacuum 8 . Thus we have w = — 1 and the universe accelerates. 9 

However, if we use the vacuum energy to accelerate our early universe, problems arise: 

• Inflation must end. Our universe is not always in a state of acceleration. Observationally, significant parts 
of our directly observable history was radiation and matter dominated and the universe decelerates instead of 
accelerates in expansion. Theoretically, we need the comoving structures to return to the horizon. 

• The end of inflation must be controlled by a rather accurate clock. The overlapping of light cones in Fig. [4] 
does not guarantee (though makes it possible) that we observe an almost homogeneous and isotropic universe. 
For example, if one significant part of the universe has much earlier end of inflation, then energy density drops 
quickly in that part and it will produce a void in our universe, breaking homogeneity and isotropy. 

The vacuum energy itself does not provide an exit mechanism for inflation. Moreover, even if we add anything, like 
tunneling to another vacuum, as the end of inflation, it will not solve the problem as long as there is no clock. For 
example, the tunneling to another vacuum happened spontaneously instead of being controlled by a clock. 10 

Thus, we have to attach a clock to vacuum energy, and use this clock to end inflation. A clock 11 is something that 
takes a number at every physical point in the spacetime manifold, with the numbers increasing along an observer's 



8 Adiabaticity cannot be maintained for the vacuum wave modes as long as 1/H. We shall come back to this issue later in details. 

9 Indeed our present universe is accelerating, and the vacuum energy is the most popular candidate for our present expansion. For the 
discussion of late time acceleration, see e.g. 1341 and the references therein. 

10 The decay of false vacuum is actually the oldest model of inflation. However, apart from the clock problem, the decay of false vacuum 
has another more serious problem: Let V be the decay rate of the false vacuum. Then when F > H, inflation will end within one e-fold, 
thus do not really solve problems. On the other hand, if V < H, true vacuum bubbles can not percolate because the space between true 
vacuum bubbles expand too quickly. In this case we found ourselves in an open bubble [351 36], and the flatness problem returns. Thus 
inflation will either end too quickly or will never end globally. 

11 To be more accurate, by "a clock", we actually mean a set of clocks, accessible at every spacetime point. Thus the clock is actually a 
clock field. 
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world line. Thus a clock can be simply realized by a rolling scalar field 12 . In other words we would need an effective 
vacuum energy, controlled by the motion of a scalar field. 

The change from a vacuum energy to a rolling scalar field can be thought of as follows: A vacuum energy plus a 
massless scalar can be viewed in an alternative way as a scalar field lying on a flat potential. By slightly tilting the 
potential, we have a slowly rolling scalar field acting as an effective vacuum energy. This picture can be visualized by 
the analog of a rolling ball along a very flat surface, as shown in Fig. [6] The rolling scalar is called the "inflaton" . 

Further, inflation could be ended by adding some additional mechanisms triggered by some particular value of 
inflaton. For example, as in Fig. [6j after </> re h, the potential V(4>) is no longer flat thus inflation ends when <f> rolls to 
rc h. In this case, the inflaton field is the clock field of the universe. 




-TTT I 



4> 




FIG. 6: Left panel: a rolling scalar field is a modest deformation of pure vacuum energy. Right panel: the inflationary 
potential. The shape of the inflationary potential is largely unconstrained, as long as it is flat enough. Inflation ends at where 
the potential is no longer flat and a process of reheating follows to reheat the universe. 



To be explicit, the Lagrangian for a scalar field is 



£0 = 



--g^d^d u <t>~v{4>) 



In the homogeneous and isotropic background level, the above equation is reduced to 

1 



C 



:P ~ V{<t>) 



The equation of motion for (f> can be obtained from varying this action with respect to 

4> + 3H<]> + V'((f>) =0 , 



(10) 



(11) 



(12) 



where prime denote derivative with respect to <p. 

It is interesting to compare this equation with the flat space equation of motion for a homogeneous and isotropic 
scalar field: <j> + V'(<j>) — 0. The new term in Eq. (12) is a friction term. To see this, consider V'(4>) — 0. In this 
case the solution of <j) is <f> =const and <j> oc a~ 3 . The (j> =const solution means a "ball" which is at rest. The <f> ^ a ~ 3 
solution means that when there is no additional driving force, the initial velocity decreases. 

To solve Eq. ( 12 ), we need yet to know H. Thus we have to couple the <j> field to gravity. There are a few (essentially 
the same) ways to couple (j) to gr avit y. One could either vary the gravitational Lagrangian together with with 
respect to g^ Vl or vary in Eq. (10) with respect to g^ v to get the stress tensor 13 and insert this stress tensor into 



the Einstein equations. Instead, here we take a simpler route: 

The physical energy density can be obtained by a Legendre transform: 

p = H^/a 3 = + Vtf) . 



(13) 



Such a concept of global time appears in conflict with the principles of relativity. Because relativity asserts there is no preferred observer 
or absolute time. Indeed, Lorentz symmetry is spontaneously broken in cosmology, by the expansion of the universe. Nevertheless, 
spontaneous breaking of symmetry just means the state breaks the symmetry, which do not reduce the power or (arguably) beauty of 
the theory. 

Using this method, the stress tensor is T M ^ = (p + p)u l _ l u l/ + pg l iv, where = d^<f>/ \/— d u 4>9 l '<f>, and p and p are given in gg) and JTI}. 
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Compare the continuity equation p + 3H(p + p) = with the equation of motion ( 12 ), one concludes 

1 
2 



p = C^/a 3 = U 2 - V(cf>) . (14) 



Thus we have inflation when potential energy dominates. 
Inserting p into the Friedmann equation, we get 

3M 2 H 2 = P =^ 2 + V{4>) . (15) 



Now we are in the position to solve this system (with Eqs. ( 12 | and (pL5[) ) . It is in general difficult to find solutions 
for a general V. For some special potentials, the system can be solved. For example, one can make an ansatz 
w = pi p =const, and this case is called "power law inflation" [37] . resulting in an exponential potential of (f>. 

However, here we expect more general solutions. Thanks to the friction force, there is a slow-roll-approximatcd 



solution of Eq. ( 12 ) when the potential is flat enough, regardless the details of the potential. This is the solution 
that the fraction force balances the external force V (<j>), such that the acceleration of <\> does not change significantly. 
In addition, to have inflation, we want the potential energy V((j>) dominates over the kinetic energy \(\> 2 ■ Thus the 
approximation is 

^ 2 « V(4>) , 1^1 « mi>\ ■ (16) 

The first condition guarantees inflation because in this case p ~ —p. The second condition requires <f> to change slowly 
such that the first condition can be satisfied for long enough time. 



With the above approximations (16), given the value of 4>, we can first solve H by 



B *Jm- (17) 



and then insert the resulting H into 3H(f> + V' ~ to obtain 

V 



< 18 > 



After obtaining those solutions, we need to step back to verify the assumptions in Eq. (16). Inserting the solutions of 
H and <f>, we have 

M 2 V' V" 
e V = ^(y) 2 <l , W=M 2 —, (19) 

The subscript V means that those small "slow roll parameters" are defined using the inflation potential. This is 
convenient because by model construction, we typically start with a potential and want to know immediately if the 
potential is flat enough to have inflation. 14 

Especially, when inflation is driven by the mass of the inflaton, we have 

W = £ . (20) 

Thus r/v actually denotes the mass square of the inflaton, measured in Hubble units. The condition \rjy\ <C 1 means, 
the dynamics of the inflationary background, has an energy scale much smaller than Hubble parameter. As we shall 
see later, the inflationary perturbations actually lives on the energy scale H, with initial condition set on an even 
higher energy scale. 

It is also convenient to define another set of slow roll parameters using the dynamics of scale factor : 

e = -~, ?7=—. (21) 
H 2 He y J 

When slow roll conditions ey <§C 1 and \riv\ *C 1 are satisfied, those slow roll parameters are related to ey and r)y by 

e = e v + 0(el,rfr) , 77 = 4e v - 2 W + 0(e 2 v , V 2 V ) . (22) 
When slow roll conditions are not satisfied, the relations between (e, rj) and (ey, rjv) are non-linear. 



Note the appearance of Planck mass M p in those small parameters. It turns out quite hard to get those slow roll conditions satisfied. 
The convenience will show up practically in calculations like expanding the action with respect to perturbations 
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E. Attractor 



As we have shown, in the slow roll approximation, given a flat enough potential, there exists a slowly rolling 
inflationary solution. To take this solution seriously, it is important to consider the naturalness: Could this solution 
be reached easily without highly fine-tuned initial conditions? 

This is in general a hard problem. Because on the one hand, the dynamics before inflation is yet unknown. Thus 
the a priori naturalness is not well defined. On the other hand, even consider the mathematical naturalness, one 
need to consider a measure of the phase space. Unfortunately, the phase space measure for inflation divergences and 
different regularization method leads to different conclusions (551440) . 

Nevertheless the solution of slow roll inflation is an attractor, in the sense that a general point in the phase space 
of inflation typically flows onto a slow roll trajectory. An attractor solution is natural, at least in a naive way. 

There is an intuitive way to see this attractor behavior: the potential energy of inflation acts as an effective vacuum 
energy, and thus do not significantly redshift during inflation. However, the kinetic energy of inflaton tends to be 
redshifted away quickly. As discussed in the last section, without continuous injection of energy from the potential, 
the kinetic energy of <j) decays as et~ 6 . 

Thus, on the one hand, if the universe starts potential dominated, we are fine to have inflation already. On the other 
hand, if the universe starts kinetic energy dominated, with a small coherent potential energy 16 , the kinetic energy 
will get diluted away and the universe will eventually become potential dominated. Thus inflation is a dynamical 
attractor solution. 



F. Reheating 

As we already emphasized, it is important to end inflation. From local energy conservation, the effective vacuum 
energy should go somewhere. The simplest case is that this effective vacuum energy decays into matter or radiation, 
which heats the universe. This process is called reheating 17 . 

After reheating, the universe is radiation dominated 18 , thus 

M 2 H 2 ~ Pr ~ T r 4 ch , T rch cx JWpH , (23) 

where H is the energy scale at reheating. We require the reheating temperature to be compatible with BBN. Thus 
T re h > lOMeV. This is considered as a hard lower bound for inflationary energy scale H > 10~ 22 GeV. Nevertheless, 
we expect inflation to happen at much higher energy scales, because it is increasingly harder to build a flat potential 
at extremely low energy scale. 

If reheating is sufficient, most of the vacuum energy goes into radiation. In this case H is close to the energy scale 
of inflation. 

Unlike the dynamics during inflation, the dynamics of reheating is extremely model dependent. Here we shall not 
come into detailed physics (see [HISS for a review). Instead we only outline the generic class of models: 

• Perturbative decay: In this case, the decay is characterized by a decay rate T in particle physics 43-45] • A 
typical scenario of this kind has the inflaton oscillate around its minima of potential after inflation. And when 
the duration of oscillation is longer than 1/r, the inflaton energy decays into other particles. Note that the 
duration of oscillation is approximately t oc l/H at the decay time. Thus the inflaton decay happens on a 
uniform Hubble (and thus uniform total energy density) hypersurface in spacetime. 

In this case the reheating temperature can be written as T re h oc y/M p T. In case of a long time of oscillation, 
the Hubble parameter at the inflaton decay can be significantly lower than the inflationary Hubble parameter. 

• Non-perturbative decay: This case is model dependent. For example, brane inflation typically end by collision of 
brane and anti-brane [41>H4"5] . The decay is very fast due to the tachyon mode between the branes. The decay 
happens on a uniform inflaton hypersurface in spacetime. This situation is illustrated in Fig. [7j 



Actually a small coherent potential energy may be a big claim because the horizon problem is back. 

Reheating, as we can read, seems to mean the universe heat up for a second time. However, we don't know what happens before 
inflation, thus actually don't know if the universe was hot before inflation. 

It is still possible that the inflaton decays into matter, and the matter later decays into lighter particles which behaves as radiation. 
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FIG. 7: Brane inflation and reheating. When the brane moves close to the stack of anti-branes, the open string mode with 
end point on both brane and anti-brane become tachyonic, which triggers the decay of the brane. Brane inflation ends on a 
uniform inflaton hypersurface. 



• Parametric resonance (preheating): When inflation ends and the inflaton is oscillating around its minima, the 
oscillation may trigger a resonance for other particles (say, x) P§H52"] . In the resonance band, the energy density 
of x grows exponentially until x has comparable energy density with the inflaton, after which the system becomes 
non-linear while the decay continues. 

Note that during reheating, the small scale physics changes. However those scales are so small that the same 
comoving scale today is not cosmological. Moreover, those sub-Hubble scales shall not go super-Hubble again thus 
there is no associated conserved quantity. Thus it is harder to probe the physics of reheating than to probe the physics 
during inflation. 

Nevertheless, there do exist a few possibilities that can constraint (and in the future may observe) scenarios of 
reheating: 



• Modification of large scale fluctuations: As we shall show later in Sec. IV B 3 and IV B 4 in some special models 
of reheating, not only the small scale perturbations but also the large scale ones can get modified. 

• Gravitational waves: The gravitational waves propagate freely after they are produced. Thus the primordial 
ones carry information of extremely early ages [53] . Unfortunately the typical energy scale of reheating is yet 
smaller than the currently generation of gravitational wave detectors. Nevertheless, a low scale reheating or a 
table top detector may uncover the physics of reheating. 



G. Open questions 

We have discussed the dynamics of inflation, and its success. Before going on to discuss the further success of 
inflation at the level of perturbations, let us pause and mention a few open questions for inflation. 19 

• The rjy -problem |54j : As we discussed, for inflation to happen, we need ey <C 1. On the other hand, for inflation 
to happen for enough e- folds to solve any problem, we need (|y<l. For example, consider a free massive field 
as the inflaton. In this case, r\y <C 1 implies m <^ H. 

Note that H is a remarkably low energy scale: The inflationary H is high in our today's high energy physics 
point of view. However, during inflation X/H is the size of the whole "observable" universe at that time. Thus 
it is nontrivial that we need an inflaton, whose Compton wave length is much greater than the size of observable 
universe, during the era of inflation. 



Those open questions, from a critical point of view, are problems of inflation. 
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Even worse, the mass of a scalar field is not protected by quantum fluctuations. Like in the case of the Higgs 
mass hierarchy problem, the quantum correction to the inflaton mass runs to the UV completion scale A of 
the theory. Even if the inflaton field only has gravitational self coupling (or coupling to other fields), the mass 
correction, by power counting 20 , is 

A- 2 - ^ ■ (24) 
p 

When A ~ M p , the mass correction runs directly to Planck scale. 

As in the case of Higgs physics, we may expect supersymmetry to save the theory from serve fine tuning. 
However, the supersymmetry, if exists, has to be broken at the Hubble scale. Thus the cancellation leaves a 
mass correction Am ~ H. 

Also, a locally supersymmetric theory typically get mass corrections of order Am ~ H. For example, consider 
a Kahler potential K ~ c(j) 2 /M 2 , where c is a number of order one. The Kahler potential appears in the 
Lagrangian as 

e K V(<f>) ~ V&) + %V(4>) ~ V(<f>) + 3cH 2 c/> 2 . (25) 
p 

Thus an effective mass Am 2 — 6cH 2 is generated. 

There are also other sources to generate m ~ H corrections. For example, no symmetry forbids the coupling 
Xcj) 2 R in the action, where R is the Ricci scalar. Note that during inflation, R — 12H 2 + 6H. Thus coupling to 
Ricci scalar generates a mass correction Am 2 = 24XH 2 during inflation. 

As another example, if <f> couples to another field a with g(j> 2 a 2 coupling, the a field gets a vev at the fluctuation 



level (as we shall show in Sec. II A I 



<- 2 >-S => Am 2 .2 ff (. 2 ) = g. (26) 

The above problems are collectively called the ^-problem. To avoid this problem, one could introduce a shift 
symmetry to protect the mass of the inflaton. Nevertheless the shift symmetry has to be broken softly to have 
inflation [551155], 

The degeneracy problem: This is a problem at the opposite direction of the m/-problem. As we discussed, it is 
hard to get a flat enough potential, because of the r\y -problem. However, if we allow some fine tuning to flatten 
the potential, there are immediately a huge number of inflation models. We shall review those models in later 
sections. 

As we shall discuss later, the number of observables of inflation is limited. Thus it is considered very hard to 
break the degeneracy, to select the correct model of inflation from observations. 



The trans-Planckian problem 57-59 : Comoving scales expand exponentially during inflation. Thus those scales 
are exponentially small in the past. Especially those scales can be smaller than Planck scale at the initial 
time. 21 How can we trust a theory with such small energy scales? Especially how to set the trans-Planckian 
initial condition? 

It is argued that the vacuum state is adiabatic on sub-Hubble scales, and thus does not have this scaling problem. 
Yet it is useful to show explicitly if there is a problem or not. 

One should also note that at the level of free theory, the Planck mass M p has no physical meaning and can be 
rescaled away. Only when we consider interactions, M p becomes physical as the energy scale where gravity is 
strongly coupled. Thus the trans-Planckian problem is really a problem of interaction. And at the interacting 
level (which is by definition non-linear), there can be trans-Planckian modes interacting with the super-Hubble 
modes as well. This situation is illustrated in Fig. [8] 



The gravitaional coupling contributes Mp 2 . Thus there should be the A 4 factor to match the overall mass dimension. 

There need to be 10 more e-folds of inflation before the observable stage, for the smallest cosmological scale to reach Planck scale in the 

past. 
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FIG. 8: The trans-Planckian problem, as a problem of non-linear interactions. 



• Initial condition of inflation: Inflation cannot be the full story of the very early universe. The inflationary FRW 
metric only covers half of the de Sitter spacetime. As a result, inflation is not geodesic complete (a more general 
proof can be found in [60:). We need a start for inflation. This situation is illustrated in Fig. [9] It is an open 
question how to prepare a homogeneous and isotropic Hubble patch, for inflation to happen. 



^FRW — +00 




FIG. 9: The FRW patch is not geodesic complete. It covers only half of the global de Sitter space. 

There are some candidates for the theories before inflation. For example eternal inflation W2\ (which yet 
needs an initial condition), or the quantum creation of the universe |23H25j . So far we are still unable to 
experimentally verify those theories. 

From now on, we shall move on to the cosmic perturbation theory. This is the field where inflation has already 
shown its predictive power, and continues to excite the community with forthcoming experiments. 

II. LINEAR PERTURBATIONS 

Now we move on to the perturbations. Perturbations are by definition smaller than background. But for inflation 
(and many other cases in cosmology) they are by no means less important. Because a perfectly homogeneous and 
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isotropic universe do not tell us a lot (and cannot find anybody to tell). 



A. A spectator field in de Sitter space 



To warm up, we first consider a spectator field in de Sitter space (a = e Ht ) [S3]. "Spectator" means gravity is 
not dynamical. Instead, gravity is just an expanding playground and do not receive back-reaction from the spectator 
field. 

Operationally, one can consider a universe with a cosmological constant A. Then take M p — > oo while keeping 
A/Mp fixed. Then a finite energy density scalar field will be a spectator field. 

For simplicity, we consider the spectator field to be a real scalar, non-interacting and massless. The action of the 
spectator field is 



S = / d 3 x dt 



1 



1 



{d l( jf 



In Fourier space, the above action is 



S = 



d 3 k 



dt a 3 



a z 



The a field can be quantized as 



0ic(i) = v k (t)a k + v* k (t)a\ , 
where a k and its Hermitian conjugate are time-independent creation and annihilation operators: 

[a k , a v ] = 0, [a k , 4,] = (2vr) 3 5 3 (k - k') , 
and Vk satisfies the classical equation of motion of a: 

k 2 



i; fe + 3Hv k 



;V k = 



(27) 



(28) 



(29) 



(30) 



(31) 



To solve this equation, it is convenient to use conformal time. The same equation of motion now becomes 



(av k )" 



a 
a 



(av k ) = 



(32) 



where prime denotes derivative with respect to the conformal time. Note that a = —1/(Ht) for de Sitter space. Thus 
a" I a = 2/t 2 . The above equation has simple solution 



Vk = ci(l + ikr)e ikT + c 2 (l - ikr)e z 



(33) 



Now we have to fix the integration constants c\ and c 2 . Before a careful treatment, let us first consider a quicker 
way: Note that av k is the canonically normalized field. Thus when this fc-mode is far inside the horizon (r — > — oo), 
av k effectively lives in flat space and should have the same solution as in Minkowski space. By comparing with the 
Minkowski space solution 1 / V2k, we get 



ci 



H 

/2k 3 



c 2 = . 



(34) 

oo. When r > —aH, the 



Note that the mode function v k only coincide with the Minkowski space solution at r - 
solution of v k becomes completely different from the Minkowski one. 

Now we shall determine the above integration constants in a more careful way. There are two observations to 
determine c\ and c 2 : 



Quantization and uncertainty. The a field is quantized in Eq. (30). Note that in quantum mechanics, quantiza- 
tion and uncertainty principle have close connections. The size of quanta determines how accurate or inaccurate 
we could measure a system. 
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In our situation, the uncertainty principle is 



7r k = ~ = a 3 a_ k , fa, 7r k ,] - l (27r) 3 ^(k - k') . (35) 

dcr k 



To compare this relation with (301, we get a normalization condition 

a 2 (v k v* k ' - v' k v* k ) = i . (36) 

Note that we have rewritten the time derivative in terms of conformal time. Applying the equation of motion 
for v k , we obtain 

H 2 

cic* - c 2 c* 2 = . (37) 

• Vacuum has lowest energy. Now we are to fix a "vacuum" state. There are two things to do to fix the state: 
First, we should relate the vacuum to annihilation operator. A "vacuum" is a state satisfying 

a k |0) = . (38) 

However, this condition is actually a precise definition of ak, among all the possible definitions. More importantly, 
we still need to know what is a vacuum as a physical state. We define the vacuum as the lowest energy state at 
initial time r — > — oo. To make it more precise, the Hamiltonian (defined with respect to t) is 

■H= X - [a 4 a 2 + a 2 {d l a) 2 ] = ^a 2 ( Cl c* + c 2 c*)(fc 2 + fcV) + 1 [a 2 Cl c* 2 e~ 2tkT (k 2 + 2ik 3 r) + c.c] , (39) 

where c.c. denotes complex conjugate. At r — > — oo, the Hamiltonian is 

-n^^p{cic\+c 2 c* 2 )k 4 . (40) 



We want this Hamiltonian to be minimal with the constraint (37). Thus we get 



|Cl1 = ' C2 = °- (41) 

Note that only the amplitude of c\ is determined. The phase of c\ is still not determined. This phase is a 
random phase, reflecting the quantum mechanical nature of the fluctuations. Eventually this random phase 
shows up, for example, on the CMB sky as the randomness of temperature fluctuations. In statistical quantities 
such as the power spectrum, this random phase do not appear. 

To collect the result of quantization, we have 

<Tk(r) - v k (r)a k + v* k { T )a\ , v k (r) = ^=(1 + ikT)e~ lkT . (42) 

So far we have discussed two ways to fix the vacuum state: Instant Minkowski vacuum, as well as the initial 
lowest energy state. This vacuum state is known as the Bunch-Davies vacuum [64] . The same vacuum state 
can also be obtained in various other ways. For example, the Bunch-Davies vacuum can be obtained by the no 
boundary wave function of the universe. This is because the Euclidean version of initial time, it = — oo, is a 
regular point on the sphere, thus shouldn't result in a divergence in the Euclidean action. Alternatively, it is 
also believed that the Bunch-Davies vacuum is an attractor solution of inflation. Quite general (but not all) 
initial states are eventually driven to the Bunch-Davies state due to the cosmic expansion. 

Before ending the discussion of vacuum, some more comments are in order here: 



The relation between Eq. ( 38 ) and minimalizing the Hamiltonian: At first glance, they look like the same 



condition. This is because, in flat space QFT, a k annihilate one particle. So Eq. (38) means there is no particle, 
and thus is lowest energy state. 22 However, on time dependent background, things become more complicated. 



In fiat space QFT, we still need C2 = to relate a k to the concept of "annihilate one particle" . 
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On a time dependent background, matter energy is not globally conserved. A "empty" state without particle 
could evolve into a state with lots of particles in it. Note that we work in the Heisenberg picture. Once we have 
defined the "vacuum" state, the state does not evolve with time. However once we define the vacuum state as 
the lowest energy state at one time, the same state could be no longer the lowest energy state at another time, 
due to the time dependence of the matter Hamiltonian. Here, we minimalize the energy at t ^ — oo, as one 
essential piece of vacuum selection. 

• The vacuum is selected in the UV: Note that r — > — oo corresponds to the fc-mode has almost zero wave-length 
and infinite energy. Thus in this procedure, the vacuum is selected in the UV. The k/a — > oo limit can not be 
reached in QFT because QFT is considered as an effective field theory with a UV completion. The UV completion 
scale is at most Planck scale. As we have discussed, this is the "trans-Planckian" problem of inflation. One 
might expect the selection of initial state at k/a —¥ oo could be relaxed to a selection of lowest energy state at 
k/a — > A, as long as H <C A <C M p . Before k/a = A, the universe can be considered as expanding adiabatically 
and no particle is produced. 

• The vacuum is selected linearly mode by mode: We first linearized the theory, and consider one fc-mode without 
worrying about other modes. In other words, we implicitly set all other fc-modes to zero. In flat space QFT, wc 
can safely do that because in scattering process we indeed have set other modes to zero, at least locally near 
the incoming wave packet. However, in a time dependent background, the situation is actually not clear. The 
perturbations of other wave length, especially longer wave length exist and we cannot just forget about them. 
The inclusion of those modes will give a contribution at higher order in perturbation theory (fortunately not at 
the leading order). 

Now we have determined the vacuum state, using a simple matching together with a more detailed discussion. The 
next step is to interpret and understand the solution. Suppose we could measure the scalar field a after the end of 
inflation 23 , what shall we see? 

We see random fluctuations and we want to extract statistical information from it. The way to do it is to consider 
the two point correlation function (cri-ov) at r ~ (the end of inflation). Inserting the solution of the mode function, 
we have 



where P a (k) is called the power spectrum. \fP„ is the fluctuation amplitude of a per Hubble volume per Hubble 

time. When P a (k) does not dependent on k (as the current case), we call the power spectrum "scale invariant". 

The scale invariance of a power spectrum means if we zoom the fluctuations into a bigger volume, and compare 
with the true fluctuations in this bigger volume, statistically we cannot find difference between those fluctuations. 
The "zoom" operation shows up in the above formula as the k 3 factor, as a normalization of the volume. 

Note that in our present example, the dynamics of the perturbation is linear. Linearity implies that all the odd-point 
correlation functions (e.g. (cr^a^a^)) vanishes, and all the even-point correlation functions (e.g. (ck 1 0'k 2 0"k 3 0'k 4 )) 
can be reduced trivially into their disconnect parts (e.g. (a^a^) (a^ak^+peimutations). Thus the power spectrum 
carries all the statistical information about the perturbations. Considering the unpredictable nature of quantum 
random phase, P a (k) is the unique function to connect theory and observations. As wc shall show in later sections, 
in case of a non-linear theory, the above is no longer true and the predictions can become much richer. 

The above discussion can be generalized to a massive spectator field. In the massive case, the above discussions 
can be directly generalized. The equation of motion takes the form 





(43) 




v k + 3Hv k + —v k + m 2 v k = . 



(44) 



a 



The solution, with the same method of vacuum choice, is (up to a phase) 



v = 



2 



(-kr) 



(45) 



Actually at this time the modes arc supcr-Hubble and not measurable. Nevertheless, wc suppose we could measure those super-Hubble 
modes at the moment. We shall return to this topic when we discuss the conserved quantity. 
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where H„(x) = J v (x) + iY v (x) is the Hankel function of the first kind. In the k/a max( \fmH, H) limit the 
solution returns to the same Minkowski limit as discussed in the massless case. 

Finally, we heuristically apply the above discussion of a to the fluctuation of the inflaton. This "application" must 
be considered with a big caution that, considering that inflaton is the field that drives inflation, the inflaton fluctuation 
couples to gravity dynamically. Thus it is dangerous and non-rigorous to assume the fluctuation of inflaton lives on 
a non-dynamical gravitational background. We will leave those issues to the next subsections. 

Now we postulate that a is the fluctuation of the inflaton field: 

0(x,t) = to(t)+ty(x,t) , (46) 

where 5<p(x, t) is small and has the same solution as a. 

Return to the intuitive picture of <j> as a clock. During inflation, on super-Hubble scales different parts of the 
universe temporarily lost casual contact with each other. It is the clock field within the Hubble patch, instead of the 
impact from other parts of the universe, that controls the reheat time of the local universe. When (f> has a fluctuation 
50, 5<f> brings an uncertainty to the clock. The time uncertainty, measured in Hubble unit, is £ = —HSt = —H5<j>/<f). 
This time difference may be conserved, considering different patches of the universe do not talk to each other. If we 
consider this £ as an observable, the power spectrum of C is 

w 2 u A n 2 
Pf= I Lp, = H (47) 

We shall see in the next sections how the above argument could be made rigorous. 



B. Decomposition of the metric 



In the last section, we considered a spectator scalar field. In that case we could operationally take M p — > oo keeping 
p a / (M 2 H 2 ) to be small. However, this limit can not be applied to the inflaton. This is because from the Friedmann 
equation, the inflaton energy density is exactly ~ M 2 H 2 . Thus gravity may not decouple 24 . 

To consider the inflationary perturbations, we need to perturb gravity as well. It is not enough to promote the 
scale factor a(t) in the FRW metric to be spacetime dependent. Instead, we have to consider the full metric, with the 
full dynamics of gravity. The dynamics of gravity is governed by the Einstein-Hilbert action 



M 2 

p 

Y 



s 9 = ^r I d 4 xV=g R ■ (48) 



To calculate it explicitly in terms of gravitational perturbations, we have to write the metric into an explicit form. 
Instead of writing g^ v in terms of gocb ffoij S02, • • • , it is convenient to write the metric in the ADM form |65j . without 
loss of generality: 

ds 2 = -N 2 dt 2 + h l3 {N l dt + dx^N'dt + dx?) . (49) 
The convenience that the ADM form of the metric brings include: 

• Clear physical meaning. The meaning of each component is demonstrated in Fig. |10| 

• Convenience of calculation. Various quantities can be calculated easily using the ADM decomposition. For 
example, the determinant of the metric is 



-g = N^/h . (50) 



And the inverse metric can be written as 



_ f -N 2 +N k N k NA ( -l/N 2 N^/N 2 . 

/, ' r ^ Nj h,, ) 9 ~ \ N j /N 2 h ij ~ N i N j /N 2 ' ' 11 

where Ni = hijN^ , and h i3 is the inverse matrix of hij (note that in general h 1 ^ ^ g 1 ^). 



One can still dump the major part of to an effective cosmological constant. It is the time variation of that invalids decoupling of 
gravity. Thus one can expect the gravitational corrections (per e-fold) to the inflaton are slow roll suppressed, compared with the case 
of a spectator field in de Sitter space. As we shall show it is indeed the case. 
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X lj rdx 



FIG. 10: The ADM decomposition of the metric. The green arrows are the components of the metric, namely, lapse of time 
Ndt, a shift of hypersurface due to the time flow N l dt and the spatial distance dx 1 . The distance N r dt + dx 1 is first measured 
by 3-metric hij, and then together with Ndt, makes ds. If dx' = 0, the time part themselves sums up to dt M , along the flow 
direction of time. 



• Decomposition of the Ricci scalar. The 4-dimensional Ricci scalar is much more difficult to calculate than the 
3-dimensional one. The ADM decomposition provides a connection between the 3- and 4-dimensional Ricci 
scalars. The derivation can be found in e.g. [66] . Here we outline the results: 

One can construct normal of the hypersuface as 



,00 



% = 0,0,0)=-$/-/=^ 
The extrinsic curvature of the hypersurface can be written as 

1 



ri l = (l/N,-NyN) 



(52) 



(53) 



This K^ v can be calculated explicitly as 

K°" = , 



K f . 



1 

2N 



V4N4 



where T>i is the 3-dimensional covariant derivative, defined from h^. 
The 3- and 4-dimensional Ricci scalars are related by 

R = (3) i? - K 2 + K$Kl ~ 2V M (-Kn* 1 + n'"V u n li ) 



(54) 



(55) 



Note that there is a total divergence term in the RHS. For Einstein gravity, this term is a boundary term. In 
the case there is no spatial or temporal boundary, this term can be dropped 25 . 

• Constraints easily solvable. Note that in the above setup, the Einstein-Hilbert action has no derivative acting on 
N, and no time derivative acting on N l . Thus N and TV* are constraints and easily solvable. The disappearance 
of derivatives on N and N l are not accidental. The reason is that gravity can be viewed as a gauge theory 
and there are constraints to be solved. This can be also understood from Fig. 10 N and N l only tells how to 
define (instead of dynamically evolve) the spatial hypersurface at the next instant of time. Thus they should be 
non-dynamical. 

One can still decompose the ADM metric further. Note that the 3-dimensional rotational symmetry is explicitly 
preserved during inflation. Thus it is convenient to decompose the metric components according to the irreducible 



Actually, even there is boundary, those terms should also be canceled by explicitly adding boundary terms |67l I68| . Otherwise the 
variation principle of the action is not well defined. 
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representations of the rotational group. The relevant representations are 3d scalar, 3d vector and 3d tensor repre- 
sentations. Thus it is convenient to decompose the metric into those components. At linear level (i.e. calculation of 
two point function), those components do not mix with each other, because different representations are orthogonal 
to each other. 

For this purpose, we decompose 26 

N t = drf + h , hij = e 2 <a 2 [Sij + 7y + V,V 3 E + V t F 3 + VjF,] . (56) 

Here the vector sector contains hi and Fj, satisfying 

V t W = , V l F i = . (57) 

The tensor sector contains 7^ , satisfying 

V i7 v = , 7? = . (58) 

The scalar sector contains N, /3, £ and E. 




FIG. 11: The meaning of £, E and hij. C i s an overall rescaling. E rescale the points with respect to its original radius to the 
origin, hij rescale the points with respect to its own direction. 

From the orthogonal theorem in the theory of group representations, modes in different representations do not 
propagate into each other. For example 27 , (C( x )lij{y)) = 0- 

The above scalar, vector, tensor perturbations need to be considered together with matter perturbations, for 
example, 8<p of the inflaton. Here we shall consider the scalar sector of the perturbations in detail. The tensor sector 
looks similar to the scalar sector and we shall only discuss it briefly. The vector sector decays in most inflation 
scenarios. Thus we shall neglect the vector sector here. 

C. Gauge invariance and gauge fix 

Einstein's theory of gravity is invariant under general coordinate transformation. The action (and thus physics) do 
not change under a change of coordinate 

x^ = x fi + ^(x) . (59) 

The metric itself transform as a tensor under this coordinate transformation. Thus components in the metric, N, (3, 
£, • • • , indeed change under this transformation. Those transformations can be written collectively as 

5 t g aP {x) = g^(x) ~ g aP {x) = g MV (x) - (6» + d a ^){8} + d^Jg^x + £) . (60) 



At higher order of perturbations, the 7^ part defined here may be not very convenient. See Maldacena's definition. 

However, note that if we want to calculate one loop contribution of {(^(x)^(y)) , we do need to consider the loop contribution from jij. 

Also, more-than-two-point functions can be mixed. For example, (C( x )C(y)7ij ( z )} 7^ 0- 
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Scalar fields also change under the gauge transformation as 

S^6(x) = $(x) - 6(x) = 4>{x) - 4>{x + . (61) 
Those transformations can be grouped into two parts: the scalar part £° and £, and the vector part £ l , where 

C = € + 0*£ , diC = . (62) 

Those two parts transform the scalar sector and vector sector respectively. The tensor sector is gauge invariant by 
itself. 

For example, under a shift of time £° = St(x), and at linear order, £ and 56 changes as 

t = ( + H6t, Ts = 56 - <p a St . (63) 

Consider a featureless spacetime with all those perturbations N, ft, £, • • • being zero. If we perform a gauge 
transformation on this spacetime, we find those perturbation quantities no longer zero. Thus there appears to 
be perturbations. Those apparent perturbations are called gauge artifacts, which need to be eliminated from the 
calculation. For example, a gauge artifact 5t brings pure background spacetime into 

( = H8t, <5^ = -0 o <5i. (64) 

There are two ways to eliminate those gauge artifacts. One way is to use gauge-invariant formalism. In this 
formalism, the gauge artifacts cancels within the gauge-invariant quantities. For example, at linear order, one can 
choose the gauge invariant quantity as the Mukhanov- Sasaki variable 

C gi = Q + H 5 -t. (65) 

However, one need to note that free of gauge artifact do not necessarily mean the effect is observable. We still 
need to identify theoretical observables with what one actually observe in experiments. The situation is particularly 
dangerous for non-linear perturbations. This is because at the non-linear level, the short wavelength fluctuations, 
which are accessible by local observers, can be mixed with long wavelength perturbations, which are not accessible 
by local observers (but still gauge invariant) [6"§1 170] . 

A related issue with gauge invariant variable is that, at next to leading order, the gauge invariant quantities are 
non-local and non-unique |71j . 

Here we follow a simpler way: to fix the gauge 28 . Once the gauge is fixed, there are no longer available gauge 
transformations thus pure artifacts cannot appear. 

It is convenient to choose a gauge with E = 0. We have yet another gauge condition to choose in the scalar sector. 
This additional gauge condition fixes spatial slicing. There are a few frequent choices. They give the same result of 
observables. On the other hand each gauge has pros and cons: 

• The 56-gauge (spatial flat gauge): We can choose to let the spatial part of the (scalar sector) metric to be flat: 
C = 0. This gauge is convenient to discuss sub-Hubble scale physics. Because on small scales (sub-Hubble, but 
not as small as Planck scale), gravity is weak. Thus it is intuitive to think about gravity as decoupled. Also, 
the calculation of the action is simplest in this gauge. 

• The (,-gauge (uniform inflaton gauge): We can choose to let the infiaton field have no fluctuation 29 : 56 = 0. 
This gauge is a comoving gauge, in that the observer moves together with the cosmic fluid and measures no 
matter flux. 

Note that the three dimensional curvature is 

{3} R = . (66) 



The observables, for example the number of photons found in the detectors, are gauge invariant. However, most quantities we talk 
about, which we have intuition for, are gauge dependent. Those gauge dependent but intuitive quantities include field fluctuations S<p, 
and the fluctuations in the metric. Thus it is still convenient to consider an observer and consider the intuitive picture of the world 
understood by this observer. 

Note that all scalar fields behave similarly under gauge transformations. Thus in the case of multi-field inflation, although we have two 
gauge conditions in the scalar sector, we could only set one scalar field to zero (even with E ^ 0). 



22 



Thus C is the perturbation of ^R. For this reason, the C variable in the C-gauge is known as the comoving 
curvature perturbation. 

This C-gauge is convenient to discuss the super-Hubble scale physics. Because on the one hand, gravity accumu- 
lates. Thus on large scales it is more intuitive to think about the perturbations as C instead of 8<f). On the other 
hand, as we shall show later, £ is conserved in the C-gauge for single field inflation. This provide a connection 
between the inflationary physics and current observations. 

However, on small scales, (-gauge is not intuitive. Let's consider an analogue situation: A table has different 
height from the ground (if we choose Euclid coordinates). And in our room, gravity is not too strong. However, 
from general covariance, we can choose a coordinate system where the table and the ground has the same 
coordinate height. However, in this coordinate system gravity is extremely strongly coupled to cancel the same- 
height artifact. In the C-gauge, on small scales the situation is similar: our intuition gets mixed up by a small 
physical effect plus a big gauge artifact (this comment also applies for <5</>-gauge on super-Hubble scales). 

Technically, we shall indeed find that in the C-gauge, we have to do a lot of integrations by parts to cancel large 
numbers in the gravitational action. 

• The Newtonian gauge: We can choose to let the shift vector vanish: (5 = 0. In this gauge one can rewrite the 
scalar sector of the metric as 

ds 2 = -(1 + 2<i>)dt 2 + a 2 (f - 2^)dx 2 . (67) 

Here <E> and \1/ are gravitational potentials. The Newtonian gauge allows us to think intuitively about how 
particle moves in the gravitational field: $ is the Newtonian potential that non-relativistic particles move in 
Newtonian mechanics. Relativistic particles could feel both $ and In Einstein gravity, constraint equations 
yield \& = and a departure is a smoking gun of modified gravity. For those reasons, the Newtonian gauge is 
very convenient in analysis of the late time universe. 

• Mixed gauges: One could even choose one gauge at one time, and another gauge at another time. As we 
discussed, the 50-gauge is nice before and at the Hubble-exit, while the C-gauge is nice after the Hubble exit. 
Thus one can choose (\k=aH = 0, and <50| ro hcating = 0. This can be accomplished by the partition of unitary. 
There are infinitely differentiable functions which vanish at k = aH and equal to 1 at reheating, and vise versa. 
For example, 

'<*> " {%• , x > o • «*) = m l%. x) • aw = 1 - aw • m 

Here fi(x) = when x < and fi(x) = 1 for x > 1. The metric 

^-crossing ^reheating)/^crossing ; 

(69) 

takes 50-gauge at Hubble-crossing and takes C-gauge at reheating. Despite the complexity of gauge condition, 
we shall find the usage of this gauge when reviewing the SN formalism. 

It is useful to note that, sometimes those intuitive gauge choices do not completely fix the gauge. For example, 
in the C-gauge, there is still residue global symmetry, known as the large gauge transformation [72]. For single field 
inflation, 

x -> X = xe A , C -> C(x) = C(x) + A , (70) 



x^S = x + 2(b-x)x-x 2 b , C-K(x) = C(x) + 2b • x , (71) 

For multiple field inflation, the change of other fields should be considered together. One could further fix this residue 
gauge (see, e.g. [73]) but we shall not need those further fixings at the moment. 



D. The 2nd order gravitational action, <5</>-gauge 



Now we are in the position to calculate the perturbations in the gravitational theory. There are typically two ways 
to do this. One way is to start from the classical Einstein equation, and perturb the Einstein equation to the linear 
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order. However, the cosmic perturbations are quantum in nature. As we observed in the spectator field case, we need 
to know how to normalize the field canonically (how large one quanta is). Thus it is better to start from the action. 
In the action approach, the first order action is just a variation principle of the background and thus vanishes. To get 
the linear equation of motion, we need the second order action, and the variation principle on top of that. 

We start from the <5</>-gauge. In the 5^-gauge, we set ( — and E = 0. The perturbation variables in the scalar 
sector are 8<j), a, f3 defined as: 



{x)-Mt), N = l + , 



The gravitational plus inflaton action is 

S-- 



d A Xy 



M 2 R 



(72) 



(73) 



To calculate the Ricci scalar R by hand, one had better go to the ADM formalism Eq. (55 1 and drop the total 
derivative. Or if a computer algebra system is used, it is technically easier to start directly from the 4-dimensional 
Ricci scalar. 

In the ADM case the 2nd order action in momentum space takes the form 



So = dt 



d 3 k 
' (2tt) 3 



a 3 {e - 3)M 2 H 2 a 2 + 2k 2 M 2 aHa/3 - a 3 V' at 
+ U 3 W 2 ~ la(k 2 + a 2 V")6q 



a 3 <j)oaS(j) — k 2 a<pof3S(f> 



2 " 2 

In 4-dimensional Ricci scalar case, the 2nd order action in momentum space takes the form 
d 3 k 



(74) 



dt- 



(27T) 



2a d (3 - e)M 2 H 2 a 2 + 6M 2 a 3 Haa 



k z M^aHafi - k z M^aa(3 - k^M^aafi - a A V aS 
\a{k 2 + a 2 V")5^ 



a 3 (j>oaS(j) — k 2 a(f)o(35<f) 



2 



(75) 



The above two actions can be related by some integrations by parts, as desired from the boundary term of (55). Note 
that in the action we have not written the k-dependence of the perturbation variables explicitly. Those k-dependence 
should be understood as, e.g. 2a 3 (3—e)M 2 H 2 a 2 <^> 2a 3 (3— e)M 2 H 2 a\s_a^. The aka-k part follows from momentum 
conservation. 

From Eq. (74), we observe that there is no time derivative acting on a or /?. This is as expected from the general 
principles we discussed when we introduce the ADM formalism. Thus a and (3 can be solved: 

1„xa ^ _ _ 



2H 



2Hk 2 



a 2 (f)ol 



(76) 



Actually we shall not need the solution of f3 when only the linear perturbations are considered. This is because 
Eq. (74) is linear in (3. Thus inserting the solution of a into Eq. (74) eliminates a and j3 together. Then after an 
integration by part to eliminate the 



S= dt 



d 3 k 
'(2tt) 3 



term, we get the second order action 
2 1 



-a 3 5(j) — 2^ 2a 



l -a 3 [V" -2H 2 (3e-e 2 +a 1 )] <^ 2 j 



(77) 



A few comments are in order: 



• The 3rd term in Eq. (77) is a mass term for the perturbations. The mass is m 2 



n - V" - 2H 2 (3e -e 2 + er 1 ). Due 
to gravitational corrections, this mass is different from the naively expected m 2 — V" . Nevertheless they are of 
the same order. 



• Note that this mass term is rather small. To see this, we recall that the energy scale for inflationary perturbations 
is H. And m 2 cS = 0{r])H 2 . 
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• Could this small mass be neglected? The answer is actually subtle. Although m e g <C H, it may take effect 
on super-Hubble scales. For the mode corresponding to the current cosmological scales, the mass term takes 
effect for about 60 Hubble times (the observable e-folding number of inflation). The same subtlety can be also 

observed in that, the 6<p term in the action would vanish exponentially on super-Hubble scales, and eventually 
become comparable with the "small" mass term. We for the time being neglect this mass term, and leave the 
subtleties to the next subsection. 

(Doubtfully) neglecting the mass term, the discussion for the 5<p perturbations become completely parallel to the 
discussion of a massless spectator scalar field on de Sitter background. The power spectrum of the S(f> perturbations 
can be calculated as 



(78) 



E. The 2nd order gravitational action, £-gauge 



As we have discussed in the previous subsection, in the (50-gauge, perturbations could evolve on super-Hubble 
scales. This invalids the naive slow roll approximation. Actually the problem is more serious: Inflation takes place at 
an extremely high energy scale. The physics from after inflation until TeV scale is unknown. How could we evolve the 
inflationary perturbations (or any other observables produced by inflation) through this unknown epoch and make 
predictions for inflation? 

Fortunately, for single field inflation, the ^-perturbation in the C-gauge is conserved on super-Hubble scales. We 
shall show the conservation at linear level in this subsection, and at non-linear level in the later sections. It is the 
conserved perturbation on super-Hubble scales, that brings connection between the very early time physics and very 
late time physics, after Hubble-reentering. 

For this consideration, it is important to consider the perturbations in the C-gauge. Here we set Scf) = and E = 0. 
Inserting the perturbations into the gravitational plus inflaton action (73 1, we get using the ADM Ricci scalar: 



So = Mi 



dt 



d 3 k 
(2tt)8 



a 3 (3 



e)H 2 a 2 



(2aHk 2 f3 + 2ak 2 ( + 6a 3 H()a - 2k 2 a(iC, 



3a J C ~ 18a d #CC + o(-9(3 - e)a 2 H z + k z )C 



(79) 



Or from the 4-dimensional Ricci Scalar: 

d 3 k 



S 2 



M 2 
p 



dt- 



(2tt) 3 



2a 6 (3 - e)H 2 a 2 + {-k 2 a/3 - 9a A HQ - 3a s Qa 
+ (6a 3 Ha + k 2 aHfi - k 2 af3 + a(2k 2 - 9(3 - e)a 2 H 2 )( - 12a 3 H( - 3a 3 ()a 
- 2k 2 aP( + {36a 3 H( + 9a 3 C)C + 6a 3 C 2 + ^a(9(3 - e)a 2 H 2 + 2k 2 )( 2 



(80) 



Again they are related by integration by parts. The constraints can be solved as 



c 

a= H 



1 H k 2 



(81) 



Insert the constraints into the 2nd order action, and do some integration by parts to eliminate cross terms. Eventually 
one finds the action to be in a very simple form 



Mi 



d 3 k 
'(2tt) 3 ' 



dt^^e{a 3 C 2 



k 2 aC) 



(82) 



Note that unlike the case of <50-gauge, now there is no mass term for the C-gauge perturbations. Thus the awkward 
super-Hubble evolution problem no longer exists in the C-gauge. There is a decaying mode and a constant mode for 
C- After a few e-folds after Hubble exit, the decaying mode goes away and C becomes conserved. We shall verify this 
conservation argument explicitly at the end of this section. 

It is useful to note two facts: 



• No slow roll approximation is assumed in the above derivation. Thus the above action applies even e ~ 1. 
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• In general, even if the cosmic fluid is not a scalar field, as long as p = p(p), one can choose the £-gauge. 

Thus the conservation of £ should hold after inflation (with the assumption of p — p(p)). This answers the question 
at the beginning of this subsection: The conservation of £ makes inflation predictive. 

Now we are in the place to calculate the quantum perturbations of £. The equation of motion of £ is 

^'+(3 + 77)^ + ^ = 0. (83) 

To the lowest order of slow roll approximation, the rj factor in the above equation can be ignored and approximately 
a oc e Ht . The solution of the above equation, as a function of conformal time, is 

C=— ^==(l + ifcr)e- ifcT . (84) 

In terms of the quantized fields, the above solution is equivalent to 

Ck = Mfeflk + u* k a -u , u k = t=e=(1 + ikr)er lkT . (85) 

MpV 4efc 3 

As in the case of a scalar field in de Sitter space, the "negative frequency" mode is disregarded and the constant 
factor is determined from the consistency of quantization together with the assumption that the initial state of the 
k-mode has lowest energy. 

Before closing this subsection, we would like to verify the conservation of C on super-Hubble scales, without the 



reference of slow roll or inflationary background. Note that (85) is an exact equation without making slow roll 
approximation. 



On super-Hubble scales, k <C aH thus the (k 2 /a 2 )( term can be neglected in (85). Thus the equation of motion 
can be approximated as 

C + (3 + 77)#C^O . (86) 

Considering the exponential growth of a, this equation is exponentially accurate on super-Hubble scales. The solution 
is 

C(t) = Cl +c 2 J dt x expj- J 1 [3 + ri(t 2 )]H(t 2 )dt 2 j . (87) 

Thus there exists a constant mode c\. The other mode is decaying as long as 77 > —3. This inequality is satisfied by 
most of the cosmological conditions (note that 77 = for a constant equation of state parameter w, and 77 ~ for slow 
roll inflation). 30 



F. Power spectrum, scale invariance and tilt 



Similarly to the case of massless scalar field, the power spectrum of comoving curvature perturbation can be defined 

as 

(CkC> = (2^3 (k + k ^ Pc(fc) , Pdk) = £^ = J^. (88) 

The observational value of the power spectrum is Pr(k) ~ 2 x 10 -9 . This value 31 is known as the COBE normalization 
[25] . Sometimes one also use A-r defined as = y/P^. 



However, it is interesting to note that in case jj < —3, £ finds itself in an effectively contracting universe, although the real metric is 
expanding. 

The value P((k) ~ 2 X 10~ 9 is the power spectrum on large scales, spanning between the scale of the whole observable universe, and 
scale about e~ 10 smaller. On the other hand, there are constraints that the power spectrum should be perturbative on small scales all 
the way down to the comoving Hubble scale at the end of inflation, by not observing the primordial black holes (see [74| for a review, 
and the references therein) formed by those small scale fluctuations. 
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Again, the power spectrum does not depend on k at the lowest order. Thus the inflationary perturbations are 
nearly scale invariant. 

Although the power spectrum is nearly scale invariant, it is still important to consider the small non-scale- invariant 
correction to the power spectrum. This correction is called the tilt of the spectrum, or the spectral index. According 
to the WMAP result, the exact scale invariant power spectrum is almost ruled out and the derivation from scale 
invariance should be a few percents. 

To describe the scale dependence, the spectral index n s is defined as 



1 _ dlnP ( {k) 



dink • < 89 > 



To calculate this quantity, it is helpful to have a closer look at the equation of motion (83). There are two ways 



to take the scale dependence into account in ( 83 1 . The most straightforward way is to consider the small derivation 
from a cx e Ht together with no longer neglecting 77. The equation of motion can be solved in this case and a derivation 
from scale invariance is observed |75[ I76| . 

On the other hand, there is an easier way to calculate the scale dependence. Weak scale dependence means that the 



parameters in (83) would vary by a small amount per Hubble time. Within a few e-folds near the Hubble crossing, 
Eq. ( 83 1 should still be applicable, with H considered as a constant and rj neglected. A few e-folds before the Hubble 
crossing the solution is linked with the instant Minkowski solution, and a few e-folds after the Hubble crossing the 
solution is linked with the conserved quantity. Both the sub-Hubble instant Minkowski solution and the super-Hubblc 
conserved quantity are not slow roll approximated but instead exact result. 

In other words, the prescription should be to take the Hubble parameter in the equation of motion as a function 
of momentum k: H — H(k), and calculate the value of H at Hubble crossing k = aH. Thus up to higher order slow 
roll corrections, we have dink ~ din a = Hdt. The spectral index can be calculated as 

dlnPAk) , . 

n s - 1 = -§} ' =-2e-r, = -6e v + 2 Vv . 90 
Hat 

The observational constraint for n s — 1 is [77] 

n s - 1 = 0.9603 ± 0.0073 (68% CL) . (91) 

Thus although inflation is nearly scale invariant, an exact scale invariance with n s — 1 is ruled out at over 5er. 
One can also define the running of spectral index, as 

dlnn* 

a ' = -dmi ■ 

For the above single field inflation model we are considering, we have 

a s = -2 V e - A . (93) 
Hr\ 

Thus a s is one order smaller than n s — 1 in terms of slow roll parameters in the above simple model. The observational 
constraint for a s is [77] 

a s = -0.0134 ± 0.0090 (68% CL) . (94) 



G. Comoving curvature perturbation as the e-folding number difference 

As we discussed at the end of Sec. |II E| on super-Hubble scales the equation of motion simplify. Actually, the 
k/(aH) parameter itself can be used as an expansion parameter. One can set up perturbation theory with respect 
to k/(aH), which could be exact to all orders of C or the slow roll parameter. This method is called the gradient 
expansion. 

There is an intuitive way to view the gradient expansion, at least at the leading order kj (aH) — > 0. During inflation, 
the universe broken up into exponentially many "local universes", which temporarily cannot talk to each other. Each 
local universe is quite homogeneous and isotropic and thus can be locally approximated as a FRW universe. 

The local properties of those local FRW universes have no difference between one another. However, when we 
compare those local universes in spacetime, there could be a difference of expansion between them at a fixed time, 
or a difference of time at a fixed rate of expansion. This difference can be produced by the quantum fluctuations. 
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FIG. 12: Time difference between local FRW universes. After reheating, the universe become radiation dominated and the 
energy density dilutes quickly with the expansion of the universe. The change of color illustrate the drop of energy density. 



For example, in one local universe, if there is a quantum fluctuation to drive the inflaton a little bit up on the 
potential, this local universe is then driven farther away from reheating. As we shall show, this difference is actually 



the comoving curvature perturbation. This situation is illustrated in Fig. 12 

It is useful to make this separate universe idea explicit. There are a few ways to do this and here we follow the 
method of [78 . Consider a local universe with scale factor 

S(t,x) = a(i)e c(t ' x) , (95) 

where C(^ x ) is almost a constant on scales comparable with the Hubble scale, and only change significantly on 
super-Hubble scales. The local Hubble expansion can be defined as 32 H = a/a. The continuity equation is 

p + 3H(p + p) = . (96) 
In terms of a and this equation can be written as 

a 3 p + p 

So far we haven't specified any gauge yet. Now we shall look at this equation in the uniform energy density gauge. 
(It can be shown that on large scales this gauge is equivalent to the comoving gauge.) If there is no entropy per- 



turbation, i.e. p = p(p), then in Eq. (97) only £ is a perturbation variable. The precise definition and meaning of 
entropy perturbation will be discussed in Sec. |IVB 1 All other variables are background variables, independent of x. 



Considering a perturbation variable should not have zero mode, we have 

C = . (98) 

For yet another time we confirmed the conservation of comoving curvature perturbation on super-Hubble scales. And 
this time the confirmation is exact for all orders of perturbations in C, (but assuming that Q is classical, both in external 
lines and in loops. For a quantum treatment, see [79]). 

Now let us make connection between £ and the perturbation of e-folding number. Note that this part is independent 
to the above discussion, in the sense that we do not assume p = p(p). Thus the result also applies for a time dependent 
£ on super-Hubble scales. 



Note that the time derivative here can denote a general time instead of restricted into physical time. The lapse function cancels in this 
equation and later related equations. 
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The connection is intuitive because £ is the perturbation of scale factor, and on the exponent. To see this connection 
more precisely, we note that the e-folding number of a local FRW universe can be written as 



N(t 2 ,t 1 ,x)= I dtH = I dt[-+()=ln 



a 



a(ti 



+ C(t2,x)-C(ti,x). (99) 



Again this equation does not dependent on gauge choice. Now we choose a mixed gauge: At an initial time ti, 
C(ti) — (flat slice), and at a final time t 2 and later times, p = p{t) (uniform energy density slice). In this mixed 
gauge, we have 

C(ta,x) = N{t 2 , t u x) - N(t 2 , h) = SN , (100) 

where SN is the e-folding number difference between the initial flat slice and the final uniform energy density slice. 
This relation is known as the 5 N- formalism 80 - 821. 



Finally, if needed, the gauge-invariance can be restored by adding back the gauged counter-part of (100): 



C&Ht) = 5N(t) + ~ (101) 
3Jp(t) P + P 

where p is the background energy density, which equals to the energy density on uniform energy density slice, p is 
the energy density without choosing a gauge. 

As a side remark, the above discussions could actually be made stronger. Consider there are n components of fluids 
(say, pi, i = 1, . . . , n) in the universe. Assuming those components do not exchange energy between each other. Then 
the energy of those components are conserved separately. In this case we have n conserved quantities: one can define 
Q as the SN on uniform pi slices, respectively. The above discussion holds component-wise. 

The SN- formalism is convenient in several aspects: 

• It is straightforward to calculate the inflaton fluctuation on the initial flat slice. As we have shown, the inflaton 
fluctuation at Hubble crossing is S<f) — H/(2n) plus calculable slow roll corrections. 

• The comoving curvature perturbation is conserved after the final uniform energy density slice, as we have shown. 

• For single field inflation, the uniform energy density slice is the reheating surface. Thus the SN defined in the 
above way does coincide with the e-folding number difference during inflation. 

• The <5-/V-formalism is non-linear in orders of £. Thus the formalism is still convenient to use for non-linear 
perturbations. 

The convenience can be observed by a sample calculation. During inflation, the e-folding number can be calculated 
as N(t) = J t tloh Hdt, A variation of this equation is 

C = SN = -H5t=S^ . (102) 



Thus 



This agrees with the power spectrum we have calculated in ( 88 ) . 

However, there is price for the simplicity. One has to pay attention to the assumptions of the above derivation, and 
be careful at the situations where those assumptions break down. For example, 

• Non-conservation of When there are entropy perturbations, p is no longer a function of p thus £ may not be 
conserved on super-Hubble scales. 

• Non-trivial reheating surface. When there are entropy perturbations, the reheating surface may not be the 
uniform total energy density slice. In this case SN no longer has the physical meaning of e-folding number 
difference of inflation. Instead, a correction term coming from the difference between uniform energy density 
slice and the reheating surface must be considered. 
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• Nontrivial initial correlation functions. If the power spectrum or higher order perturbations of S(j) is modified 
at Hubble-crossing, the modification needs to be considered as an input of the <5iV-formalism. 

• Ignorance of the decaying mode. The requirement p — p(p) is not the same as the requirement of single field. 
This is because a scalar field has second order equation of motion and thus two modes. On super-Hubble scales 
one mode decays away thus p = p{p). However, in case both modes needs to be considered for some reason, one 
need to examine if p = p(p) still hold. 

• Validity of local FRW metric. This is typically not a problem for inflation. Because locally, non-FRW energy 
components are diluted. However, for alternative models to inflation, this is not necessarily true. 

• Break down of classicality. At the non-linear level, quantum fluctuations on sub-Hubble scales may interact 
with the super-Hubble fluctuation. <5 TV-formalism is classical and do not carry this quantum nature. 

Finally, let us make one more clarification of the gradient expansion method and the ^TV-formalism: There exist 
sub-Hubble fluctuations. When considering local FRW universes, why we neglect those sub-Hubble fluctuations? 
Actually, those sub-Hubble fluctuations, as high energy modes in a general QFT consideration, can be integrated out. 
As a result, the background equation of motion for the inflaton acquires a stochastic source term |83j : 

4> + 3H<P + V' = ^-H 5/2 rj(t) , (104) 

where rj(t) is a Gaussian noise obeying 

(vihHh)) = S(h - t 2 ) . (105) 
This is exactly the origin of the Scf) = H/(2n) fluctuation at Hubble crossing. 

H. Observable inflation, de Sitter space and eternal inflation 

So far we have been talking about the observable stage of inflation. When calculating the dynamics of inflationary 
perturbations, we have been using the lowest order slow roll approximation. Especially, when calculating the equation 
of motion for the perturbations, the mode functions are the same as that obtained in (the FRW patch of) de Sitter 
space (see our toy model of spectator field). 

However, despite of the approximations, we should be careful in thinking about the observable stage of inflation as 
de Sitter space. 

For example, one note that the power spectrum of comoving curvature perturbation is cx H 2 /(eM 2 ). When 
e — > with H fixed finite, the power spectrum diverges. This is very different from the observed power spectrum 
P c ~ 2 x 10~ 9 . 

The problem not only arises as the divergence in the number. But also, C ( or other observables stands for density 
perturbation of inflation) manifests itself only after the end of inflation and after the perturbation modes returns to the 
horizon. The exponential expansion of de Sitter space never end thus one can not observe the density perturbations. 

Even worse, in de Sitter space, the "super-Hubble" density perturbations are not only non-observable. The whole 
semi-classical picture of cosmic perturbation theory may break down in the de Sitter limit. It is argued that, like the 
black hole complementary [81] , de Sitter space may have complementary as well [55] . 

Consider two comoving observers A and B, who start near each other with distance nearer than Hubble. Because 
of cosmic expansion, at a later time, B will exit the horizon of A. The horizon exit process of B, as seen by A, is 
a disaster of B. The observer A measures the Gibbons-Hawking radiation with a temperature T ~ H. Blue shift 
this temperature to the near horizon regime, A actually finds B destroyed by the Gibbons-Hawking radiation when 
approximating the de Sitter horizon. This picture is different from that experienced by B, who finds nothing special 
because of the equivalence principle. The different semi-classical pictures observed by A and B contradicts. But this 
is not a problem as long as the observers no longer return to each other's horizon to compare notes. 

Applied to cosmic perturbations: for a local observer, the physical picture of perturbations in de Sitter space should 
be that, the perturbations approaches the de Sitter horizon, get burnt by the blue shifted Gibbons-Hawking radiation, 
and get emitted back by the horizon. There is no other copy of information which actually "exits" the horizon. 

Thus the physical picture is different between the perturbations in a stage of observable inflation, and that in de 
Sitter space. The change of physical picture actually not only happens at the exact value e = 0, but takes place when 
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e becomes so small that P^ > 1. Note that the power spectrum can be also written as P^ oc H 2 / (2ir(f>) 2 . Thus when 
P( > 1 we have 

I < I ■ 

The LHS is the classical motion of inflaton per Hubble time, and the RHS is the quantum fluctuation per Hubble 
time. Thus the above equation can be interpreted as that, the quantum fluctuation of the inflaton dominates over 
the classical motion. 

The classical motion of the inflaton always roll down the potential in the slow roll regime. However, the quantum 
fluctuations has equal probability to fluctuate up or down. After one Hubble time, a Hubble volume self-reproduces 
into e 3 ~ 20 copies. If in one or more of those 20 new Hubble volumes, the inflaton field has greater value due 
to the large quantum fluctuation, the dynamics in those patches are driven away from the reheating surface. The 
self-reproduction continues and inflation will never globally end. This situation is known as eternal inflation [ST1 162) . 

During eternal inflation, when following an observer's comoving world line, the observer finds that inflation eventu- 
ally ends locally. However, due to the lack of global end, the global structure of eternal inflation is analogue to that 
of de Sitter space. 

As in the case of de Sitter space, the semi-classical picture of cosmic perturbations break down during eternal 
inflation. Again this is not only because of Pq > 1, and not only because of lack of global end of inflation, but also 
that the whole picture cannot be trusted. 

To see this, it is convenient to consider (the slow roll analog of) the static patch of de Sitter space, instead of the 
FRW patch. In the static patch, the metric can be written as 

ds 2 = -(1 - H 2 r 2 )dr 2 + dr \ , + rW . (107) 

1 — H z r z 

The stretched horizon is defined as the surface with one more area unit compared with the classical horizon: 



rsn = ^ W2 ~ Wr (108) 

This definition is reasonable because stretched horizon tsh, the Gibbons-Hawking temperature is the Planck temper- 
ature. The temperature has a redshift 

T dr 1 . . 

T7 = / = ; ■ (109) 

H y/-ds 2 VI - H 2 r 2 V ; 



Setting T = M p , one get the tsh in equation (108 1. Complementarity becomes important as soon as objects (e.g. 
perturbation modes) reach the stretched horizon. 

Now we want to estimate when complementarity becomes important for slow roll inflation. Roughly, we could use 
the result for pure de Sitter space, and let H vary with respect to time. Now the field excitation propagate towards 
the stretched horizon and the stretched horizon is also expanding itself. One should compare which of the two goes 
faster. 

The field excitation near the stretched horizon falls towards the stretched horizon at speed of light: 

dr H 2 

Measured by an observer at the center, the stretched horizon itself moves at a velocity (in terms of r and r) as 

dr SH dr S n d ( 1 \ 

" SH = ^ = ^r^UJ =e - ( } 

Here one should note that v is local process near the stretched horizon thus there is a huge redshift factor, however, 
i>SH is measured at the center of static patch, thus dr = dt when calculating i^sh- 
To compare those velocities, one note 
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where is the power spectrum for curvature perturbation during inflation. For the observable stage of inflation, 
~ 2.4 x 10~ 9 . Thus during the observable stage of inflation the field excitations would never reach the stretched 
horizon. Thus the apparent difference of the static patch and the FRW patch is merely a coordinate transformation. 
However, for eternal inflation, Pf > 1. Thus during eternal inflation the Hubble radius is expanding so slowly that 
the field excitations could reach the stretched horizon. 

On the other hand, even one believed the semi-classical picture of eternal inflation, the density perturbation gener- 
ated during eternal inflation could never be observed even in principle. This is because when the £ ~ 1 modes returns 
to the horizon, they form black holes. The global structure of spacetime is not accessible by an observer in a black 
hole [IS]. 

To conclude this subsection, when taking the de Sitter space limit, one hits eternal inflation and a shift of semi- 
classical physical picture. The calculations of power spectrum and non-Gaussianities are not affected (although the de 
Sitter approximation is used for the mode function), because the perturbations never reaches the stretched horizon. 



I. Gravitational waves 



The inflaton is not the unique light field during inflation. Instead, there is at least another light field - the 
gravitational waves [87l - f89l| (for modern conventions, see, e.g. [90]). The calculation of gravitational waves goes 
similarly to the scalar sector. One could either do a brute force calculation of the tensor part, or first calculate the 
Rlcci scalar on Minkowski background and then do a conformal transformation into an inflationary background. The 
result is 

s= ^Iw^ dTa2 ^'- k2 ^ ■ (113) 

Note that the traceless and transverse conditions eliminates 4 components from the 6 components of a symmetric 
3-tensor. Thus there are only 2 physical modes in 7*. Those two modes can be decomposed by introducing the 
polarization tensors 

7«(k) = [7+(k)4(k) + 7 x(k)e l x J .(k)] , (114) 

where the symmetric, traceless and transverse properties are carried over by ej- and e*- as 

4' X =4' X > ^=0, e+- x =0. (115) 
Further from the reality conditions we have 

e+' x (-k) = (e+> x (k))* . (116) 
And we have the freedom to normalize e^' x . The normalization we take is 



Finally, we need to fix the ambiguity of decomposition in Eq. (114). For this purpose, it is convenient to choose the 
two polarization tensors as the spin 2 eigenfunctions of rotation along the k-axis: 

e±^e™e±, e* ^ e" 2 ^* . (118) 

Those conditions fixes the function ej' x (k) as a function of k (up to a phase). 

It is helpful to construct those polarization tensors in an explicit way. To do that, we make use of the rotational 
symmetry of the theory to first rotate k to be (0, 0, k). Then the polarization tensors are 





For a general k, a 3-d rotation matrix is multiplied onto the above polarization tensors. 
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With the above decomposition, the action of the tensor mode becomes two decoupled components. Theoretically 
this is because they are different irreducible representations. Technically this is because efj(e* k )* = 0. The tensor 
mode action takes the form 

S=\J j^dr a 2 [(</ +7+ ' - fc 2 7+7+ ) + ( 7 ' x7 x' - A: 2 7x7x )] . (120) 

For each polarization mode, the action is the same as that of a massless spectator scalar field. Thus 



P 7 + ' X H^) • ( 121 ) 



The tensor power spectrum is defined as 



W'( 7ii (k) 7w (k')) = (2^) 3 <5 3 (k + k')^P 7 , (122) 



Compared with Eq. (114), we have 

o tt2 

F ^^w r - < 123 » 

To compare with the scalar power spectrum, a tensor to scalar ratio is defined and calculated as 

p 

r = J = 16e = I6e v . (124) 

The observational constraint on r is [77] 



r < 0.11 (95% CL) . (125) 



It is important to note that, unlike the scalar power spectrum (88), the tensor power spectrum (121) does not 
depend on e. In other words, P+' x is solely determined by H, the energy scale of the inflationary perturbations. Thus 
if the gravitational wave power spectrum is detected, we immediately know the inflationary Hubble scale. 

Also, for the minimal single field inflation, the slow roll parameters ey and rjy can be calculated as functions of 
observables n s and r: 

£ v = {q, Vv = \{n s - 1) + y^r . (126) 

Thus measurements on n s and r can be considered as measurements of the first two derivatives of the inflationary 
potential. Especially, the sign of r\y determines if the inflationary potential is convex or concave. The latter is favored 
by current data |77j . One can plot the single field inflation models (as a function of e- folding number) as line intervals 
on a two dimensional n s -r plot. 

The spectral index of gravitational waves is 

n T = = -2e = -r/8 . (127) 

Note that both the very LHS and very RHS of the above equation are expressed in terms of observables. Thus 
iit = —r/8 is a consistency relation for single field inflation. 

It is also interesting to note that, the tensor to scalar ratio is related to the distance that the inflaton rolled in field 
space during inflation. The inflaton's rolling distance per e-fold is 

_^ = v_ = _P^r 128 ) 
dN H 4 y ' 

a. 



Thus in 60 e- folds, cf> rolls a distance A<p ~ 15M 2 y2r. In words, large tensor to scalar ratio r indicates a longer 
distance of inflaton's motion. This relation is known as the Lyth bound [91] . One can correspondingly classify 
inflation models into large field and small field models, with A<fi > M p and A</> < M p respectively. 
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J. Other fields 



As we have seen, the construction of inflation requires two types of light fields: the inflaton field and the gravitational 
waves. However, we know from QFT that there are many other fields. What do other fields behave during inflation? 
Here we discuss several possibilities: 

• Gauge fields: Non-interacting gauge fields in 3+1 dimension have a conformal symmetry. The action of a 
non-interacting gauge field can be written as (take U(l) as an example) 

S= -\ [ d^x^gg^g^F^F^ . (129) 



4_ 

Note that F^ v — d^A v — d u A^ does not really contain metric or covariant derivative. Moreover, the scale factor 
dependence (using conformal time) in \J—g and g^ a g v ^ cancels. Thus the gauge field does not "know" it lives 
on an expanding background instead of on a Minkowski background. This is due to the conformally flat nature 
of the FRW metric. As a result, the large scale fluctuations of the gauge field is suppressed by k 2 /a 2 and thus 
not important or observable. 

On the other hand, if for some reasons we want the gauge field to have large scale fluctuations, then we have to 
break the scale invariance of the field (see, e.g. [9"2"]). 

• Accidental, or other protected massless fields: In this case, our previous discussion of a spectator field in de 
Sitter space applies. During each e-fold, the field has a fluctuation a = H/(2tt). Thus on long time scales, the 
field has a random walk behavior o~(x) — H \fTTi / (2tt) . 

However, one should note that, it is not easy to have a massless field without a symmetry. This is because, 
without the protection from symmetry, the quantum corrections generates a mass for the field. For the same 
reason as the Higgs mass hierarchy problem, the quantum correction for a unprotected mass parameter runs to 
the UV completion scale of the effective field theory. This generates a mass correction Am 2 ~ A 4 /M 2 . If there 
is supersymmetry (which has to be broken at Hubble scale), the mass correction is brought down to Am ~ H. 

• Field with a small mass m <C m^: The dynamics of such a light field depends on the field value. If the field 
is located at the origin of the potential, there is no much difference between the current case and the case of 
m = 0. However, as we discussed, because of the random walk behavior, the field cannot naturally stay at the 
minimum of the potential if there were a large amount of inflation before the observable stage. To estimate the 
field value, we note that there is a competition between the random walk and a classical roll-back of the field. 
The classical roll-back can be estimated using the field equation 

ZHa ~ -V' = -m 2 a . (130) 

We may approximate H to be constant (considering that ct is not the field that drives inflation, which modifies 
H). In this case, a can be solved as 

( m 2 At\ 

a cx exp I — J . (131) 

Thus at large field value, the field rolls back exponentially fast and cannot random walk. On the other hand, 
when the exponential factor is small, random walk dominates. Given enough e-folds of inflation (including 
inflation happened at the non-observable stage), the balance is reached at 

At ~ ^ . (132) 

During the above time (corresponds to H 2 /m 2 e-folds), the field randomly walks by a distance 

TT TT2 

Act x VHAt . (133) 

27T 2tTTO V ' 

This is the typical distance for ct, away from the origin. On large scales, the very light field forms a curvaton 
web [93] . The dynamics of the curvaton web can be analyzed in detail using the Langevin equation and the 
Fokker-Planck equation. If there were no enough e-folds of inflation, a cutoff of e-folding number needs also to 
be imposed. 
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At perturbation level, the present case share similar equations with the massless case. However, due to the 
non-trivial background motion, the conversion mechanism from entropy perturbation to the comoving curva- 
ture perturbation could be very model dependent. The definition of entropy perturbation together with the 
illustrations of mechanisms are discussed in Sec. IIVB II 

• Field with a small mass m ~ : 

This case is again different from the m <C tua, case. This is because the a field now rolls fast enough that it 
does affect the variation of the Hubble parameter. Thus both fields contribute to the background evolution. In 
this case we have multi-field inflation. 

• Field with a mass <C m < H: In this case, the a field is heavy compared with the background energy 
scale - the inflaton mass. However, a is not heavier than the energy scale of perturbations - the Hubble scale. 
Thus the field stays at the origin of the potential at the level of homogeneous and isotropic background, but the 
perturbations could couple to the inflaton and thus observable. In this case, we have quasi-single field inflation. 

Note that if there is minimally broken supersymmetry during inflation, then the standard model fields should 
get a mass correction of order H. This is known as the "?yv-problem" for the inflaton |54j . and opens up a 
variety of possibilities for other fields. 

• Massive fields with m ^S> H : Those fields decouple from the light fields at the tree level. However, considering 
that inflation is UV-sensitive, it is still an open question if loop corrections would make the imprint from those 
very massive fields observable in the sky |94j . 

III. NONLINEAR PERTURBATIONS 

As we have shown in the previous section, linear perturbations already carry interesting information about the 
physics during inflation. However, the information that linear perturbation carries is quite limited. This is because, 
for each field, the power spectrum is a function of k only. Even worse, nearly scale invariance brings this function 
down to a few numbers with increasing difficulties to measure. 

For example, consider the comoving curvature perturbation £. Currently we have measured two numbers: the 
amplitude Pq itself at a given k, and the tilt n s — 1. If there were no features in the power spectrum, even a third 
number (the running of spectral index a s ) would be very difficult to measure. The gravitational wave sector is even 
worse. Even the amplitude is not detected yet, leaving only an upper bound. 

If the above two numbers are the only things that one could get from inflation (which actually comes directly from 
the nearly scale invariance of the background, not the detail dynamics), then it is very hard to distinguish between 
inflation and alternatives, or between inflation models, let alone to use inflation as an ultra high energy laboratory of 
physics. 

One expects quite a few possibilities to overcome the above lack of prediction problem. For example, a detection 
of tensor mode will be of much help. We shall also discuss other possibilities such as features and entropy modes in 
later sections. Here, we consider another class of possibilities - interactions, which produce non-Gaussian fluctuations 
in the sky. 

A. Interaction picture and in-in formalism 

To calculate the interactions, it is convenient to use the interaction picture 33 . Let us start by reviewing how the 
Hamiltonian formalism and the interaction picture are defined on a time dependent background (95j . 



Historically the advantage of interaction picture perturbation theory versus the old fashioned perturbation theory (Lippmann-Schwinger 
equation) is that the former explicitly preserves Lorentz invariance. In the cosmological background, there is a preferred time direction 
anyway. However, it is nontrivial to find the eigenstates which diagonalize the time dependent Hamiltonian for perturbations. This is 
the major difficulty for Lippmann-Schwinger equation in cosmic perturbation theory. 
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(134) 



1. The Hamiltonian for the perturbations 

Given a theory, we have a Hamiltonian H [<fi(x, r), 7r(x, t)] 3435 , with equal time commutator 

[<Mx, t), 7T{,(y, r)] = iS ab S 3 (x - y) , [<£ (x, r), </> 6 (y, r)] = , [vr Q (x, r), 7r 6 (y, r)] = , 

where a, 6 indices denote different fields or field components. The equation of motion can be written as 

4>' a = i[H,cj> a ] , n' a =i[H,ir a ] , (135) 

where as usual, prime denotes time derivative with respect to the time t. To do perturbation theory, we need to split 
(j) a and 7r Q into background and perturbations. The background is assumed to be c-number valued functions which 
satisfy the classical equation of motion 36 . For our purpose, typically it is convenient to define the background as the 
homogeneous and isotropic part of the fields, which are the FRW metric and the zero mode of the matter fields. The 
perturbations <5</> a (x,r) and <57r a (x, r) are define by 



(j)a (X, T) = (j) a (T) + 5<p a (x, T) , lT a (x, T") = 7T a (r) + 5lT a (x, t) . 

The commutators for perturbations are not affected by the c-number valued background. Thus 

[6<f> a (x, t), <57r b (y, t)] = iS a bS 3 (x - y) , [Sc/> a (x., r), 5<j) b (y, r)]=0, [Sir a (x, t), 5vr b (y, r)] = 
The Hamiltonian can be expanded with respect to the perturbations as 



SH[0,ir] 



(x,r) 



6H[<l>,n] 



^7r(x, t) 



<57t(x, t) + H[5(j), Sir; r] 



(136) 
(137) 

(138) 



where the terms which are linear in <5</>(x, r) and <57r(x,r) vanish due to background equation of motion. The term 
H[S(f>, 5tt;t] is the Hamiltonian for perturbations, which is of second and higher orders of 5<p and Sir. Note that 
H[5(f>, Sir; t] is explicitly time dependent because now we write it as a functional of 5<j) and Sir, and the time dependence 
coming from </>(£) and 7t(t) appears as an explicit time dependence. 

One can verify the consistency of taking H [8(f), Sir; r] as the Hamiltonian for perturbations, by calculating from 
Eq. Jl5l): 



H,S<f> a 

H, STT n 



where U(t, to) is the time evolution operator, satisfying 37 

d T U(t,T ) = -iH[5<l)(T ),5Tr(T );T}U(T,T ) 



^ (T) = f/- 1 (T,r )^ a (ro)C/(r ) To) , 
6ir a (T) = U~ 1 (T,T )5ir a (T )U(T,T ) , 



U(t ,t ) = 1 



(139) 



(140) 



2. The interaction Hamiltonian 

The Hamiltonian H [5(f), Sir; r] contains quadratic terms and higher order terms in 6(f) and Sir. The quadratic terms 
result in linear equations of motion, which typically lead to simple solutions. However, the non-quadratic terms in 
H[5<fi, Sir; t] results in non- linear equations of motion, and in most cases cannot be solved exactly. Perturbation theory 
is in position to solve this problem. 



34 The x dependence in the Hamiltonian is a functional dependence, because the Hamiltonian has an integration over x. On the other 
hand, the Hamiltonian inherits implicit time dependence from </>(x, t) and 7r(x,r). 

35 Here we haven't specify the meaning of r yet. Nevertheless, we shall mainly have the conformal time in mind thus we denote it as t as 
we did in previous sections. 

36 Note that there may be no unique classical background for some inflation models (e.g. [96 97 ). In this case, the definition of in-in 
formal ism b ecomes tricky. 

37 Prom ( 139 1 we can directly get d T U(T,ro) = — iH [<5</>(r), 8it(t); t]U(t, to) . And note that the canonical equations |139i implies = 
§pp , thus H[5<P(t),5tt(t);t] = H[84>(t()),8tt(to);t]. This is how | |140[ | is derived. Later we shall use this trick again in 145| l. 
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For this purpose, we split H[6<P,Stt;t] into 

H[6(j), Sir; t] = H [8<t>, Sir; r] + Hr[84>, Sir; r] 



(141) 



where Hj[S<p, Sir; r] is the interaction Hamiltonian. According to the split of H and Hj, we define the time evolution 
of the interaction picture fields 5<f>^ as 



8<£ = i[H ,84t] 
5ni' = i[H Q ,5ni] 



84>1(t) = U^(T,T )S^ a (T )Uo(r,T ) 
8ni(r) = U^^to^MUo^to) 



(144) 



where Uq(t, tq) is the time evolution operator of the free theory, which is supposed to know explicitly from a given 
Hq via 



d T U (r,To) = -iHolScf) 1 (t ), Sir 1 (t );t]U (t,t ) , U (t ,t ) = 1 



(145) 



To fix 8(1)^ completely, we have to choose the initial condition as well. Formally, this can be done as in the flat space 
QFT, by requiring that at some initial time To, 

Hi(To) = Hai T o) , ^a( T o) = Sir a (T ) ■ (146) 

Combining Eqs. ( |140[ ) and (145) (which implies d T (JjQ l ) = iUq Hq), and making use of ( |146[ ), we have 

OAUoHr^Uir,^)] = -iU^^T^HrWfo^faiTohTpfaTo) = -iU^Hr, t )U(t, T )fl/[^(r), 8tt(t); t] . (147) 
As one can directly check, the solution of this equation is 
F(t,to)=Uq\t,t )U(t,t ) 

=1 + (-*) / dn Hi(n) + H) 2 / rfn / 1 dr 2 h i (t 1 )h i (t 2 ) 

J TO J T J Tq 

+ (-if f dn ^ dT 2 f 2 dT 3 H I (T 1 )H I (T 2 )H I (T 3 ) + ■■■ 



--Te 



-i r Hj{T')dT' 



(148) 



In the last line, the time ordered integrals have been compactly written into a time ordered exponential. However the 
exponential still needs to be calculated order by order in the form of the second and third line. 
Now an operator in Heisenberg picture can be expressed via the interaction picture field as 



Q(t)=F~ 1 (t,t )Q i (t)F(t,t ) 
where T is anti-time ordering. 



ij^H,(T')dT' 



Te J -o 



Te 



-i { T HAT')dT 



(149) 



Note that this split is just for the convenience of calculation and thus contains some ambiguities in it, which shouldn't affect physics. 
For example, in the case of field with modified sound speed (which we shall discuss in detail in later sections), we have 

H = J d 3 x X -a 2 [{a') 2 + c 2 k 2 a 2 } , a' = n^/a 2 (142) 

We could solve H[8(P,8tt;t] directly, which is a straightforward generalization of c B = 1 case. Or instead, when |c s — 1| <C 1 we could 
split H[8c/>, 8iz; t] into 

H = J d 3 x l -a 2 [(a') 2 + k 2 a 2 } , H z = J d 3 x l -a 2 [k 2 (c 2 - 1)<t 2 ] . (143) 
The two methods of calculation agree as they should at zeroth and 1st order of |c s — 1|, with the usage of first order perturbation theory. 
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3. The time evolution of the vacuum 



As in flat space QFT, F(t,tq) is the time evolution operator of the quantum states in the interaction picture. In 
the aspect of quantum states, there is one state of particular importance: the "vacuum" state. 

We have discussed the vacuum state in section [II A| This discussion is reused for a few times for the inflationary 
scalar and tensor perturbations. However, here new ingredients come in, because of the interactions. In flat space 
QFT, there is already a difference between the interaction vacuum and free vacuum. The most part of this section is 
parallel to flat space QFT. Comments are made when the expansion of the background brings a difference. 

Let us use |0) to denote the vacuum state of the free theory, which is defined by annihilation operators. We use |ft) 



to denote the interaction vacuum, which is to be constructed using |0). As discussed in section II A it is important 
to note that the vacuum is the lowest energy state. 

We first expand |0) using a complete set of energy eigenstates of the full theory. On expanding background matter 
energy is not conserved and we have yet to define gravitational energy. Nevertheless, we could consider the energy 
eigenstates at an initial time for the perturbation modes. 39 Near tq —> — oo, we have 

e - iH ( T -T )^ = J2e- iH ^- T ^\n){n\0) = ^ e~ lE - {T - To) \n){n\Q) = e - jEfi(r - To) (ft|0) + ^ e- lE "' (T - Tn) \n')(n'\0) , 

n n n' 

(150) 

where in the last line nf denotes the states excluding the ground state. Now we add a small imaginary part to time: 

T ->T = r(l-ie) . (151) 

Near tq —> — oo, the ie factor introduces a term e -°°x eE ». Thus only the |f2) state survives from this ie factor. 40 Now 
| ft) can be solved as 

|n> <njo) — F(r ' To)|0>= <njoj • (152) 

From now on, we shall neglect the tilde on r. In other words the r in later sections should be understood with a small 
imaginary part. 



4- The in-in expectation value 



Now combine (149) and (152), the time dependent expectation value of an operator can be written as 

(01 \fe iS ^ Hl{T ' )dT ' 



(ft|Q(r)|ft) = 



Q'(r) 

Ml 



Te 



-irHi(r')dr' 



|Q) 



(153) 



Note that in the anti-time ordering [T • • • ], the ie prescription is actually with a different sign (i.e. r — > r(l + ie)) from 
the time ordering part [T ■ ■ •]. This is because the T part is the Hermitian conjugate of T part, which flips the sign 
of ie term. This contour of integration, specified by the above ie prescription, is known as the in-in contour (Fig. 13). 



Strictly speaking this requires all modes to be sub-Hubble. However, during inflation there are always Hubblc-crossing and super-Hubble 
modes. Also, the hierarchy between Hubble scale H and Planck scale M p may not be large enough to contain all those modes safely. 
There are a few assumptions here: 

• (fi|0) ^ 0. This may be fine considering Hj is a small perturbation. However things may get worse during a long time of interaction. 

• If (Q|0) grows with time, the growth near r — > — oo is slower than exponential. 

• It is good to interpret r as conformal time. Because if we interpreted the r here as proper time i, there is no limit t — > — oo. 
However, during inflation the energy scale H drops slowly. This indicates that Ht ~ const X t may be a better measure of time 
than t oc —e~ Ht . 

Unfortunately, we know very little about the interaction vacuum. Thus it is hard to explicitly verify those assumptions. 
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FIG. 13: The left panel is the contour of integration in the in-in formalism. In the right panel, the contour is deformed to a 
vertical line running top-down. The deformed contour is used numerically in [98, 99] and more generally in [1UUI HUfj . 



The denominator deserves a more close look. By taking Q{t) = (identity matrix), we can actually write |(0|£1)| as 



<0| 






Te -i/; Hi(r')dr'" 


10) 





(154) 



To get the RHS, we require both the states (0|f2) and (0|0) are normalized. The [T ■ ■ ■ ] part cancels the [T ■ ■ ■ ] part 
because it is just the Hermitian conjugate of a unitary operator. Thus finally we arrive at the master formula, known 
as the factorizcd form of in-in formalism: 



(n\Q(r)\n) = (o| 



fe lJ ^ Hl(T ' )dT 



Q J (r) 



Te Jt o 



|0) 



(155) 



Practically one expands the above formula order by order in perturbation theory. For example, the first order term 
in the expansion is 



(n|Q(r)|0)! = 21m / dri(0|Q'(T)#/(n)|0) , 

Jt 



(156) 



and the second order terms are 



(fi|Q(T)|n) a = f d n f dT 2 (0\H I (n)Q I (T)H I (r 2 )\0)-2Re f d n I ' dr^Q 1 ' {^Hj^H^Q) . (157) 

J Tq J To J To J To 

However, one should note that the order in Hj does not necessarily agree with the C(C") expansion. For example, 
when calculating the four-point function of general single field inflation, both the first order expansion of in-in 
formalism (with Hj being the 0(C ) vertex) and the second order expansion of in-in contributes, with comparable 
amount |102| . 



Before moving on, let us note that there are a few variants of the in-in formula ( 155 ): 



• The Wick-rotated factorized form 100, 10 lj : In the right panel of Fig. 13 if there are no poles or cuts between 
the dashed contour and the solid contour, both contours give the same result. Using the solid contour, the in-in 
formalism can be written into a more compact form: 



(0|Q(r)|fi) = (0|f 
= (Q\f 



Q\t) exp 
Q\t) exp 



zoo+r 



H I (T E )dT E 



— zoo+r 
oo 



|Q) 



Hi(t + iT E )dT E 



|0) , 



(158) 



where in the second line a further change of variable is applied to simplify the integral. The Wick-rotated 
factorized form makes the (linear) UV convergence of in-in integral more explicitly. 



• The commutator form [95j : Eq. (1551 can be rewritten as 

(fi| Q(r) |0) =5~> / d n f 1 dr 2 ■ ■ ■ P ' dr n ([ffj(r n ), [JT/(t„_i), • • • , [ff/(n), Q 1 (t)} ■ ■ ■ ]]) . (159) 

Q J To J To J To 



To see this, one first observe the n — order term agrees with (155). Then assuming the (n — l)th order term 
also agrees, the time derivative of the (n — l)th order terms generates the agreement of the nth order terms. 
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The advantage of ( 159 1 is that, there are some cancellations in the IR (i.e. r — > 0) from the commutators, which 
are not obvious in the form of ( 155 1. For example, the two terms in ( 158 ) could have cancellations between each 
other in the IR. 



However, the form (1591 has also a disadvantage: The it and — ie terms are mixed in (1591 thus we may meet 
divergences in the UV (i.e. r — !• — oo), due to the difference between |f2) and |0). 

Similarly, the commutator form can also be written as 



00 r-T r-T r-T 

(n\Q(r)\n) =J2 in dT i / dr 2 ' ' ' / dT n ([HxiTt), [Hj(t 2 ), ■■■ , [Hj(r n ), Q 1 (r)] •••]]) 

n= Q J To JTx JT n -i 



(160) 



The mixed form |98| : To keep both the advantage of ( |155 1 in the UV and (159) in the IR, one can choose an 
intermediate time r c (which is typically |fcr c | ~ 1) and re-group the terms before r c using the factorized form 
and the terms after r c using the commutator form as 



dTi (commutator form) 



dr i+l 



dr n (factorized form) 



(161) 



i=i " T " 

The result is independent on the intermediate time r c , and well behaved both in the UV and in the IR. 



B. 3rd order gravitational action 



We now use the in-in formalism to calculate the three point function for inflation. For the minimal model of 
inflation, the three point function is small. On the other hand it is dominated by pure gravitational actions, instead 
of V"'(<j>) from the explicit non-linearity in the inflationary potential. 

To calculate this, we need to derive the third order gravitational action. As an example, here we only do the 
calculation in £-gauge and using the ADM form of Ricci scalar. 

For simplicity, here we take the Planck unit: M p = 1. 



1. Constraints 



For calculating the three point function, we have to solve the constraint equations. Naively, we would expect that 
we have to solve the constraint equations up to third order in £. However, fortunately, we still only need to solve the 
constraints up to first order. 

In general, to derive the second order action, one needs to solve the first order constraint equations. To derive the 
nth order action (n > 3), one needs to solve the (n — 2)th order constraint equations |103j . 

To see this, recall that there are no time derivatives acting on constraint variables. 41 Also the spatial derivatives 
on constraint variables can be converted to momentum k by going into momentum space (to save writing, we shall 
not do it explicitly). Taking the lapse function for example, in general we have 



S 



d 4 x C(N) 



SS 







(162) 



Let us expand N = N a + Ni + ■ ■ ■ + N n + • • • , where N n denotes the 0(£") order terms. The constraint equation 
dC/dN — can be expanded (up to 0(C)) as 



dC 

dN 







(163) 



o,C° 



9£ 

dN 



iVi 



o.C 1 



d 2 c 

dN 2 



(164) 



o,c° 



if one used the 4-dimensional Ricci scalar for calculation, there exist time derivatives on constraint variables. But those time derivatives 
can be eliminated by integration by parts. 
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where the subscript 0, C a denotes the term does not contain N, and is ath order in £ (or the shift vector). 
Now consider the nth order action (n > 3). All the terms that contains N n are 



S n D / d 4 x N, 



dN 







(165) 



o,c° 



This term vanishes because of (1631. All the terms that contains N n _i are 



S n D / (Fx N n _ x 



d 2 C 



o.C 1 



dN 2 



o,C°, 



(166) 



Again this vanishes because of (164 1. However, from N n _ 2 on, the coefficients in the constraint equation no longer 
coincides with the coefficients in S n . The above proof also goes through for the shift vector Ni . Thus we have to solve 
the constraints to order 0(( n ~ 2 ). Now we want to calculate S3. Thus Eq. (81) is enough for solving the constraints 
(and hi = at this order). 



2. Action 

The second order action is already given in (79). The third order action takes the form |103j : 

/( 27 9 27 9 

dtd 3 x a 3 \ 3H 2 a 3 - eH 2 a 3 - 9H 2 a 2 ( + 2>eH 2 a 2 Q + —H 2 aC, 2 - ^eH 2 ^ 2 - -^H 2 ^ + ^eH 2 ^ - 6Ha 2 ( 
v £j £1 £1 £1 

+ l8Ha(( - 27#C 2 C + MO 2 - 9C(C) 2 - ^Hab^C + \llQ> % d£ - -^Had^d.C + ^H^d^ + -WfiiC 

a z a z a z a z a z 

Gj G> G> 0/ Of G> CL 

- ^a(d 3 bi) 2 - ^((djh) 2 - -^ad&dibi - ^(djhdibj + ^Ha 2 d 2 f3 - ^Ha^P + ^H( 2 8 2 f3 

_ l a (d 2 f3 + ^(Cd 2 (3 - ^bjditdidrf - ^djpdiCdtdrf - -^adjbididj/3 - \u),h,iU), 3 - -^a(d^-/?) 2 
a z a z a 4 a 4 J a 4 a 4 2a 4 

- i« + i»< a2 « 2 + i« aJ « 2 " - - \ Va ? - \ v ?) ■ < 167 » 

Note that we can set bi — 0. This action is simplified into 

/( 27 9 27 9 

dtd 3 x a 3 \ 3H 2 a 3 - eH 2 a 3 - 9H 2 a 2 ( + 3eH 2 a 2 ( + —H 2 a( 2 - -eH 2 a( 2 - —H 2 C, 3 + ^eH 2 ^ - 6Ha 2 ( 

+ l8Ha(( - 27HQ 2 C, + MO 2 - 9C(C) 2 - ^Hadtfdit + 4#CWC 

+ -^dipCdit - \a{d t () 2 - ^C(^C) 2 + ^Ha 2 d 2 p - ^HaCd 2 (3 + -^H( 2 d 2 /3 
a z or a z a z a z a z 

- ^a(d 2 f3 + ^(Cd 2 (3 - ^drfditdidjP - ^jaididrf) 2 

CL CL (X Zid 
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3. Integration by parts 



Inserting the constraint Eq. (81) into the third order action (168), and do a lot of integration by parts , we get 



5, = / dtd 



:{a 3 e 2 CC 2 + ae 2 ((d() 2 - 2aeC,dC,d X 



L la 4a 



2/(0 



sc 



c 



•}■ 



where \ is defined by 



d 2 X = a 2 e( , 



thus the first line of (169) is 0(e 2 ) in slow roll and the second line is C(e 3 ). In the third line, Sl ~ 



(169) 
(170) 

is proportional to 



the equation of motion, thus does not contribute to the three point function 43 . Just for keeping track, /(C) is 



/(C) = |c 2 + ^CC + 



-dCdc + d-^didjidiCdjO] , dcdx-d-^didjiditdjx)} 



4a 2 H 



2a 2 H 



(171) 



And Cb is a collection of boundary terms 44 

1 



Cb =d t 



iCiatdjCdidsx - d 2 (d 2 x ) 



^2 1 

2 S A 2aiJ 



t{d i d i xdid i x-d i xd l X 



1 



i 



4aff 3 



4aH 2 

+ ^(2Ca 2 c^c - KWd^ + diC^-C) + ^(CdjCW-H - WWH + CC^C - C 2 5,C) 

(172) 

The spatial part ■ •] do not contribute to the three point function. Because there is no spatial boundary in our 
inflationary universe. However, we have to be more careful about the time total derivative. This is because for in-in 
formalism, we do have a time boundary, typically chosen at a time where C becomes conserved after that. In the 
a — » oo limit, only one term in the dt[- ■ ■] survives 45 |104j : 



£ fc ^ t (-fa 3 C 2 c) 



(173) 



Doing the integration by parts is quite difficult. The readers can save some time by expanding the boundary terms in Eq. | |172| l, to keep 
track of what integrations by parts we have performed. 

Note that we are doing perturbation theory, thus it is allowed to use the linear equation of motion, when calculating the third or higher 
order actions. On the other hand, one could insist not to use the equation, but instead do field redefinition £ n = £ + C(f 2 ) to eliminate 
^ | . In this case, all the f terms can be eliminated without using equation of motion, and there is no boundary term contributing to 
the action, in terms of £ n . 

In calculation, one does not need to write down all of them explicitly. In this way it is clear to see which integrations by parts we have 
performed. On the other hand, for the purpose of calculation, only the boundary terms that both (a) contain j on the boundary (i.e. £ 
in the bulk), and (b) non- vanishing on the boundary, need to be considered. 

It appears that this boundary term contains £" when expanded in the bulk, and thus the variation principle is not well defined. However, 
actually there is no problem because we are allowed to use linear equations of motion in the bulk. If we choose not to use the linear 
equations of motion, but instead perform field redefinitions to get rid of high derivative terms in the 3rd order action, then this boundary 
term does not appear 11031 . 
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To conclude, the third order Lagrangian, which contributes to the three point function is 

C 3 =a 3 e 2 (C 2 + ae 2 C(d() 2 - 2aeCd(d X + d t (-2? a 3 ( 2 ( 

+ °^C 2 C + ~ d(d X d 2 X + ^d 2 C(d X ) 2 (174) 
2 la 4a 

where the first line dominates over the second line in slow roll approximation (note x is proportional to e). 

Before to proceed, it is important to note that, the dominate contribution for £ 3 come from gravitational interac- 
tions, instead of the inflaton's direct coupling. To see this, consider the inflaton's direct coupling V"'8<fi 3 . The third 
derivative V" can be estimated as 

V'~~ZH4>, d t {V') = V"4> = -ZH^xO{e) , d t (V") = V'"j> = -3H x 0(e 2 ) , (175) 

where 0(e) collectively denote order of slow roll parameters. Thus 

V'" = * x 0(e 2 ) ~ ^ffi . (176) 
6 



Gauge transform V'Scj) 3 into £-gauge, using 6<fi = —<j)£/H, we have the Lagrangian for inflaton self interaction in the 
C-gauge: 

C 3 D -Vm^C 3 = 0{e 2 ) x -^C 3 = 0{e 3 )C, 3 . (177) 



This belongs to the second line of (174), which is sub-dominated. In other words, gravity is not the weakest force 
during inflation. The inflaton self interaction is weaker (although it is not a gauge interaction, in which case gravity 
is conjectured to be the weakest force |1051 1106j 1. 



C. Prom action to Hamiltonian 

As in classical mechanics, a Legendre transform H = J d 3 x — Cj brings Lagrangian to Hamiltonian. Especially, 

potential energy V((f>) flips sign in the Legendre transform. 

For the gravitational action, the situation is a bit non-trivial because there is derivative coupling. Fortunately, the 
third order interaction Hamiltonian is still just flipping the sign of the third order action: 

H 3 = - J d 3 xC 3 . (178) 

We shall derive the above equation, together with a non-trivial relation for the 4th order relation between H4 and £4 
[107] . Readers who are not interested in this detail are suggested to skip the remainder of this subsection. 
We consider a Lagrangian up to 4th order in perturbations: 

+ hf b } cd a a a b a c a d + h^ c a a a b a c + h$a a a b + h^a a + / 4 > , (179) 

where a a are the dynamical fields where a runs over the number of those dynamical fields. The coefficients 
function of a a but do not contain a a . The n index in the parenthesis denotes the coeffi- 
cient is nth order in a. 

First one define the conjugate momentum 7r a : 

**=o&> " Fabnb ' (180) 

where F ab is the inverse of such that F ab f^J = £». 

Now a a can be solved in terms of n a order by order (up to 3rd order here) as 

• - T - - W 2) - >° fl £V - |*~s2>*v . (i8i) 



*(i)-y' "(2)-~2 r yb ~2 J 
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(0) „(l)_a^fc 



Now the Hamiltonian density is 



Updkpij (o; I (0) Q 6 c _ ipi^W i 3 F ifc ft (i; I 



,(0) ^ahi 
abjk 



(182) 



(183) 



where all the a a on the RHS are to be replaced by Eqs. ( 181 ), ( 182 1. Finally, define the interaction picture derivative 
as discussed in Sec. MI A 21 



7T 

T 



(184) 



Inserting this definition into the Hamiltonian, the Hamiltonian density becomes 



(185) 



We observe that at the third order, we indeed have H3 — —£3 even when there are derivative couplings. On the other 
hand, at the 4th order non-trivial terms arises in the last two lines. 



D. 3-point correlation function (bispectrum) 



Now we have derived the Hamiltonian, and are in the position to use in-in formalism to calculate the correlation 
function. We only do it at the leading order of slow roll. 

Inserting the mode function (851 together with the Hamiltonian (first line of (174 1 and Eq. ( 178[ )), and perform the 
time integration. The result is w |103j 



p2 

(C kl Ck 2 Ck 3 ) = (2^) 7 <5 3 (k! + k 2 + k 3 ) Hh/k 3 ,k 2 /k 3 ) , 

K x K 2 K 3 



(186) 



where there is a factor of Pj> on the RHS. Thus the three point function is also called the bispectrum. The dimensionless 
shape function 

w, , 3(77 - e) (fcg + fc| + fcg) 3e {k Y k\ + k x k\ + fcgfcf + k 2 kj + k 3 k\ + fcafcf) {k\k\ + k\k\ + fcffcf) 

J-(Ki/Ks,K2/K3) = — . . 1 ; -■ . . r £ 



3fcifc 2 fc 3 



6kik 2 k 3 



kik 2 k 3 (k 1 + k 2 + fc 3 ) 
(187) 



The shape functions are plotted in Fig 14 Note that when ki/ka 0, J- oc k^/ki. This k\/ks — > limit (or limits 
k 2 jk\ — > or kzjk\ — > 0) is called the squeezed limit. The squeezed limit is of particular importance in the study of 
non-Gaussianities. 



Here the slow roll approximation is assumed. Numerical code without slow roll approximation is available in 108, 109 . 
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(a) local shape 




4 k 2 /k 3 



(b) approximately local 



(c) approximately local 





FIG. 14: The panels (a), (b) and (c) are the shape functions 



(k^+k 2 + k^) (k 1 k 2 +k 1 kj+k 2 kf+k 2 k 3 + k 3 kf+k 3 k 2 ) (kjk 2 +kfk 3 + k 2 k 3 ) 

3kik 2 k 3 ' 6kik 2 k 3 k 1 k 2 k 3 (k 1 +k 2 +k 3 ) 

respectively. Note that only the configurations satisfying the triangle inequality ki + > k$ makes sense in physics. The 
part with fci + < k$ is set to zero by hand and should be ignored from physical consideration. The shape function in panel 
(a) is known as "local shape" , because local processes could only generate this shape. The physical meaning of local shape is 
discussed extensively in Sec. |III E| The other two panels look similarly to local shape. Especially all those 3 shapes have the 
same scaling when k\/k$ — > 0, that T oc kzjk\ in this limit. 



E. Local non-Gaussianity and the local /ml 

We explain in this subsection the physical meaning of local non-Gaussianity [110] , The local non-Gaussianity is 
originally introduced as a position space expansion of non-Gaussian perturbations around Gaussian perturbations 4748 : 

o n 07 Qi 94^ 

C(x) = C 9 (x) + 5 W 3 2 (x) + 25 W s 3 (x) + ^h NL C^) + ggg^LCKx) + ^'^(x) + • • • , (188) 

where Cg( x ) is a perturbation variable satisfying Gaussian statistics. Note that this is an ansatz, instead of a general 
expansion. Because in general, the non-Gaussian estimators /jvl> • ' " could depend on position, and terms like Cg( x ) 
is not necessarily expanded at the same spatial point. As a result, the terms like fNLCgi*) h as to be in general written 
as a convolution. 



The funny numerical coefficients come from that /jvl is originally defined in terms of the gravitational potential 

In Maldacena's paper, there was a minus sign in front of the /nl term. Here we use the WMAP convention. A summary of conventions 
can be found in [TTT] [112]. 
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The local ansatz has clear physical meaning. This ansatz assumes non-Gaussianities are generated independently 
at different spatial points. Translating this requirement into the context of inflation, typically this implies the non- 
Gaussianity is generated on super-Hubble scales. 49 

Especially, note that, for positive /jvx, the part of contribution from positive £ s gets enhanced, and the part of 
contribution from negative £ g gets suppressed. This provides an intuitive way to visualize the sign of $nl- 

In position space, this non-Gaussianity can be characterized by the probability distribution function of Q. The 
probability distribution is plotted in Fig. [15] Note that the tail at large and positive £ indicates where structures are 
formed in the universe. Thus positive f^L means more galaxies are formed given a fixed power spectrum. For the 
CMB sky, positive /nl means there are more very cold spots than very hot spots on the CMB, and more modestly 
hot spots than modestly cold spots 50 . 
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FIG. 15: Probability distribution of £ for different }nl parameters. Positive }nl corresponds to the red curve in the figure. 
Negative /jvl corresponds to Q — ¥ — C of the red curve. This figure is only for illustration purpose. The real world |/jvl| is 
much smaller than the one plotted here. 



The position space features of Jnl is intuitive to understand. However, the inflationary perturbation manifests 
itself in momentum space. There are much more configurations of three point function to look at, than just looking 
at position space probability distributions point by point. Thus the momentum space correlation functions provides 
a much more powerful test of non-Gaussianity than the isolated position space test. 

In momentum space, the corresponding expression is 



7 1 



NL 



d 3 q 
(2tt) 3 



C 9 C 9 



k ■ 



Inserting the expansion into the three point function, the three point function can be calculated as 
(C kl Ck 2 Ck 3 ) - (2^) 7 <5 3 (k 1 + k 2 + k 3 ) ^g, p d k i) p d k 2) + (k a «> k 3 ) + (k a ^ k 3 ) , 



(189) 



(190) 



Note the logic here: Local mechanisms (super-Hubble generation of non-Gaussianity) produces local shape non-Gaussianity. However, 
local shape non-Gaussianity may not necessarily be produced by local mechanisms. Nevertheless, counter examples are unknown to 
the best of my knowledge. On the other hand, super-Hubble conversion of non-Gaussianity does not necessarily produce local shape 
non-Gaussianity, as long as the non-Gaussianity is generated on sub-Hubble or Hubble scales 113 114 . 

On CMB cold spot means higher actual energy density. Because the CMB photon needs to climb up a bigger gravitational potential 
before reaching us. 



4G 



This non-Gaussianity has the same shape as the first term of (187). The other two terms in Eq. (187) has different 

ocal shape Jnl is (115] 

(191) 



shape, but behave quite similarly. The observational bound of 

f NL = 2.7 ± 5.8 (68% CL statistical) . 
The local ansatz ( |188[ ) can be generalized to multiple species: 

c(x) = wm*) + Iy1 %*( x )^( x ) + 1 %fc^i(x)^i(x)50 fc (x) + • ■ . 



(192) 



ijk 



Note that in this generalization, the locality of non-Gaussianity is not modified. Only that multi-source of fluctu- 
ations become possible. Here the notation Njj... and 8cj>i is used, having in mind that one can interpret Nij... as 
dN/ (d(j>id(j)j ■ ■ ■)■ In this case the expansion (192) is related to the 5iV-formalism. 
Inserting the expansion into correlation functions, at the three point level, we have 



NL 



(193) 



While for the 4-point function, we have 

(CkxCk.CksCkJ = (2^) 9 ^(k! + k 2 + k 3 



27_ ( P\ 
100 5JVi I k\k\k% 



3 perm 



-JNL 



Pi 



Z-i3 Z-i3 

fi, 1 fi, 2 /t, 1 



11 perm 



13 



(194) 



Because of the Pj? factor, the four point function is also known as the trispectrum. Now even for the local shape, 
there are two estimators qnl and tnl- In terms of the ansatz (192), we have 



9NL 



tnl 



Y. l]k ^N lk N 3 N k 



(195) 



Note that gNL depends on N^, which does not appear in the expression of three point functions. On the other hand, 
tnl is determined by 2Vy and Ni, the same coefficients that determines $nl- Thus it is interesting to see if there are 
relations between tnl and fNL- Interestingly, such relation indeed exist. Using the Cauchy-Schwarz inequality 



Y^{A l A l ){B ] B. j ) > ^2(AiBi)(AjBj) , 



(196) 



where Ai and Bj are real vectors, we have 
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tnl > t^Jnl 



(197) 



This relation is known as the Suyama-Yamaguchi relation [116] . Especially, when there is only one source of fluctuation, 
the above inequality takes the equal sign. Similar relations also exist for higher order correlation functions |117j or 
loop corrections |118j . 



F. Soft limits of correlation functions 



1. The squeezed limit 



Here we are to consider the physical meaning of the squeezed limit. When one of the momenta is much smaller 
than the other two, say, ki/k^ <C 1, the k\ mode is already super-Hubble and become classical and conserved when 
the modes — ^3 exit the horizon [103] . This constant mode can be considered as just a shift of background. 5152 
Thus the three point function should reduce to the background shift of a two point function. 



On the other hand, if Q is for some reason not conserved on super-Hubble scales, it cannot be considered as a shift of background. 
Because one can distinguish £ and a redefinition of the background scale factor by the time evolution of Examples of this type include 
multi-field inflation, and a growing mode of £, as discussed in Sees. |IVB| and [TV A 4| respectively. 

There could also be initial correlations between the three modes on sub-Hubble scales, which are stretched to super-Hubble scales as 
they were, and happen to provide non-vanishing squeezed limit. An example of this type is discussed in Sec. IIV A 31 
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To see this explicitly, one can expand (Ck 2 Ck 3 ) with the presence of a background ki mode |119| : 

Ck 2 Ck 3 = (Ck 2 Ck 3 )o + C kl ^r- (Ck 2 Ck 3 ) + • • • ■ (198) 



where <9/<9£ki can be written as a shift of background. Recall the large gauge transformation (70): via a A- 
transformation, we have 

Ck, - C kl + A( 27r) 3,3 (kl) , k _> ke -A , (2.)3.3 (kl) _|_ = ^ = = . (199) 

To get the final equation, consider a general function f(e~ x k). We have d\f = —e~ kf and dkf = e~ A /'. 
Thus we have 53 

C kl ^-(Ck 2 Ck 3 ) = - (27r)3 S (kl) ^(^Ck 3 ) • (200) 

Contract this term with (£ki • • • ) , we get 

p2 

lim (Ck.Ck.Ck3) = -(2^) 7 <5 3 (k 1 + k 2 + k 3 )-^3 (n s - 1) . (201) 

This consistency relation should be satisfied when ^ is indeed a shift of background, especially for the standard 
model of single field inflation. To check that, we note the shape function takes the limit 

lim F(k x /kz, k 2 /k 3 ) = ^-(2e + 77) , (202) 
fci/fe 3 ^o 4fci 

Thus 

fe JimjC kl Ck 2 Ck 3 ) = (2T) 7 5 3 (k 1 + k 2 + k 3 )^pf (2e + V ) . (203) 

Recall that the spectral index is n s — 1 = — 2e — r/. Thus the consistency relation is indeed satisfied for the three point 
function that we just reviewed. 



Comparing with the squeezed limit of Eq. ( 190 1 



lim (C kl Ck 2 Ck 3 ) = (27T) 7 ^ 3 (k 1 + k 2 + k 3 )|^Pc(fc 1 )P c (fc 2 ) , (204) 



we have for this minimal model 

fNL = ~(ji.-l). (205) 

The squeezed limit discussion of three point function can be straightforwardly generalized to higher point correla- 
tions. As a result, in the squeezed limit, we have 

dfNL , dg NL 

9NLCC-, h NL cc—, .... (206) 

Those consistency relations can also be checked by resuming the perturbation Hamiltonian with the presence of a 
classical mode of perturbation [120J, or a result of unitary [121] or operator produce expansion [79"] . 



53 The delta function on the denominator is not well denned. However, one can consider perturbations in a box and take the box size to 
infinity to cure this problem. 
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FIG. 16: Soft limit of correlation functions. In this example, we have ki + k2 + k3 — > 0. The left panel illustrates momentum 
conservation, and the right panel illustrates the soft limit is dominated by a propagation of a soft momentum. In this limit we 
get jnl oc JnlQnl • 



2. Soft limit of internal momenta 



Additional soft limits arises for n-point function with n > 3. For example, now we can have ki + k2 + • • • + k m — ► 
, (to <= n — 2) without any external momentum being zero (note the ordering of momentum is not important here. 
Thus this covers any partition of summation goes to zero). The situation is illustrated in Fig. 16 



Note that the exchanged scalar has a divergent behavior l/k A when k = ki +k2 



+ k m — > 0. Thus in this limit, 
this propagation should be the dominate diagram. The n-point correlation breaks into a m + 1-point correlation times 
a (n — to + l)-point correlation. Note that for to > 1 and n — m > 1 (which is the requirement that k is an internal 
momentum), the zeroth order expansion in (1981 does not vanish 54 . 

We again apply a similar argument as the squeezed limit: k is a shift of background, thus Cfc ~ (27r) 3 S 3 (k) . With 



this shift, the zeroth order expansion of (198) is not affected and thus 



lini(C kl Ck 2 • • - Ck„ 
k—>-0 



(CkiCks ' ' ' Ck m )(Ck m+ iCk m+2 ' ' ' CO 



[(27r) 3 <5 3 (k)] 2 

Especially, in case of four point correlation function, this relation produces 55 

x - 36 f 2 

~ 25 ' 



(207) 



(208) 



Once we relax the single degree of freedom assumption, there can be more than one mode propagating as the soft 
mode. Also the propagating mode could contribute a different shift of the background. In this case 



36 
25' 



tnl > t^Inl 



(209) 



This is again the Suyama-Yamaguchi relation. 



3. Usage of unitarity 

Here we derive the Suyama-Yamaguchi relation once more, by showing that the Suyama-Yamaguchi relation is 
actually a special example of optical theorem in cosmology [121] . The main purpose of this subsection is to illustrate 
the unitarity method, and how to cut cosmological Feynman diagrams in general, despite that the same relation has 
already been derived twice in this review 56 . 

In the in-in formalism, an expectation value can be calculated as 

(Q(t)) = (F*Qi(t)F) , (210) 



Even for m = 1, the zeroth order expansion may not vanish if there are tadpole contribution to the background. However, one can 
cancel this tadpole before doing the whole calculation. Or one can take the tadpole into account in the correlation function level, and 
the conclusion should not change. 

Note that for more general shape of non-Gaussianity, this relation is in general an integral relation in momentum space, which manifests 
itself better in position space 1 12X1 122| . 

I thank Miao Li, Chunshan Lin and Shinji Mukohyama for discussion for the contents in this section. 
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where F is a unitary operator defined by 

F = F(t,t )=Texp^-i £ Hi(t)dtj , (211) 

and Qi(t) is the corresponding operator in the interaction picture. 
To be explicit, we have 

(C kl Ck 2 Ck3> = (^Ck^k.Cks^) (212) 

and 

(C kl Ck 2 C qi Cq 2 ) = (^CkXk.Cq.Cq^) , (213) 

For the time being, we restrict our attention to the states that created by a single field operator. Because in the 
folded limit, a single internal propagator should be dominate considering that P(k) ~ 1/k 3 . Thus we consider the 
unitarity formula restricted to those states, and the limitations will be addressed in later sections. In the Heisenberg 
picture, the states are time independent and thus even in a FRW background we have 

-E/^«l = E/|^(f)l0Koi(f). 

where Up is the mode function of 4$ , defined as 

0W= u W a W + ( U »r( a W)t. (215) 
A one particle state (well before horizon crossing) is defined as 

%) |0> ' (216) 

This is because in the very early times 57 , the perturbation modes live in flat spacetime. 

In the interaction picture, the operators evolves with the free Hamiltonian. While the states evolves with the 
interaction Hamiltonian, with time evolution operator F as |0) — > F\Q). Thus we have 58 

^E/lfs^^lOWUa^ (217) 

Note that the RHS of this relation has time dependence. This relation holds at any time when field content of the 
model do not change, or change trivially. For example, a single field component decays into radiation can be considered 
as a trivial change, when the interested p is on super-Hubble scales. This is because on super-Hubble scales, a scalar 
field and a fluid share the same properties, and can be calculated in the same way in the in-in formalism. On the 
other hand, curvaton decay in the curvaton scenario cannot be considered as a trivial change of field content, if the 
curvaton is decayed into the same radiation as the inflaton decay product. This is because there is a reduction of low 
energy degrees of freedom in this case. 



One should also note that (217) itself is not the unitarity condition. Rather it is derived from the unitarity relation 



from sub-Hubble time scales. Thus a violation of (217) such as a non-trivial change of field content is nothing scared. 



It could be simply a violation of the derivation procedure of (217), instead of a violation of unitarity. For example, 
there could exist monopoles, bound states or other non-local objects in Minkowski space, with radius greater than 
the horizon size when putting the fields onto de Sitter background. Those states cannot exist any longer because of 
causal considerations, and thus may signal an apparent shortage of states in de Sitter. 



We shall eventually contract this integration momentum p with ki — k2. Thus here "early" means well before the horizon crossing of 
the comoving wave number ki — k2. 

Note that in general there is also |0}(0|, which contributes the sum. But here we are only interested in the connected components in the 
correlation function. Thus this term is ignored. 
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By inserting equation (217) into the calculation of in-in formalism, we have 

2 



tnl 



(M) = £ 



pM(M) 
"p(MT 



(218) 



where • • • denote the corrections that cannot be expressed by (217), (say, non-trivial change of field content, bound 
states, two particle states via loop diagram, and not-so-folded contributions. 



IV. INFLATION MODELS 



So far, we have almost restricted our attention to the minimal model of inflation. On the other hand, there are a 
vast number of inflation models around. Those models brings various novel predictions. Here we introduce some of 
those models. 



A. General single field inflation 

In the case of single field inflation, there are already a lot of possibilities for generalization. 



1. Generalizing the Lagrangian 

The dynamics of inflation is determined by its Lagrangian. Thus towards generalization of the minimal inflation 
model, it is natural to consider how to generalize its Lagrangian. Here we restrict our attention to the scalar sector 59 

An immediate question is if we can generalize the potential. The answer is almost negative, considering we have 
already considered the potential as a general function of <fi in the minimal model. 

However, this is not the end of the story. We have assumed V(<p) is single valued. One could consider multiple 
valued V ((/>), for example <j> moving on a monodromy potential. This opens up interesting possibilities such as large 
field inflation in string theory |123j . One could also consider features or small oscillations on the potential. The 
inflaton could either tunnel though the potential |124[ 1125] . or rolling but with slow roll conditions violated (see 



Sec. IV D) 



Now let us go beyond modifying potential. The task is to find a general scalar Lagrangian, which is well behaved 
theoretically. Especially the Lagrangian should be ghost free. Otherwise either the theory is not consistent, or we are 
not expanding around the correct vacuum 60 or imposing the correct boundary conditions 61 . 

The ghost free requirement can be expressed in more explicit terms. Consider we have some background, and want 
to quantize small perturbations on this background: 

First of all, we would like to restrict our attention to the theories with correct sign of the kinetic term: S = 

J d i x(+8(j) + • • • ) instead of S = J d A x(—54> +■••). Ghost is not welcome to physics [130J. To see this, note that 
the propagator is the inverse of the second order action. Thus a wrong sign kinetic term results in a wrong sign of 
the propagator 62 

(219) 



p 2 — m 2 p 2 — m 2 + it p 2 — m 2 — it 



We yet have the freedom to choose the sign of it term for the ghost. However, if we choose the plus sign, the imaginary 



part of (219) has different sign from normal particle. Thus the optical theorem is violated and there is no probabilistic 
interpretation of the theory. On the other hand, if the minus sign is chosen, negative energy modes propagate forward 
in time. Thus the ghost particle carry negative energy. In this case the vacuum decays into ghost particles even 



One may use higher spin fields or modified gravity as inflaton as well. We shall not step into this huge literature in this review. 

In this case, a ghost condensate 1126. 11271 may drive us to the correct vacuum, as in the case of tachyon condensate. 

We typically impose simplest boundary conditions such that the field approach its classical value and quantum fluctuation vanishes 

at infinity. However, if alternative boundary conditions are imposed, such as that imposed in PT-symmetric quantum mechanics, the 

ghost may be eliminated |128l 11291 . 

Here we work in flat space. Because in the UV, cosmological background returns to flat space. And the ghost is already troublesome 
enough in the UV. 
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if ghost and other particles only interact gravitationally. Even worse, the decay rate for vacuum to decay into two 
photon and two ghost particles is 



M4 



(220) 



where A is the UV cutoff of Lorentz violation. Thus in a Lorentz invariant theory, the decay rate is divergent. The 
divergence is physical: in each frame there is a non-vanishing decay rate and one has to integrate over the infinite 
volume of Lorentz group. Any small decay rate is forbidden. 

Secondly, we require the equation of motion for perturbations to be second order in time derivatives. This is 
because, in a higher order derivative theory (for example, 4th order), the propagator can be written as 



1 



(p 2 — m\){p 2 — m 2 ,) to 2 — to 2 . 



p 2 — TO 2 



p 2 — m 2 



(221) 



Thus one of the two terms behaves as a ghost 63 . 

We would like to emphasize that the above requirements are imposed on the perturbations. The background could 
still have higher orders derivatives as long as perturbations are well behaved. 

The minimal inflation model is definitely ghost free. However, it is not the most general ghost free theory. Instead, 
the most general theory of a scalar, coupled to gravity and with second order equation of motion for perturbations, 
is [TMfI55] 



C =P(0, X) - G 3 (0, X)U4> + G 4 (0, X)R + G A .x [(n^) 2 - (V M V„0) 2 ] 

+ G 5 (4>, A)G^V"V> - ^G 5 , x [(D^) 3 - 3D<HV M V^) 2 + 2(V AI V^) 3 ] 



(222) 



where P, G3,G4,G5 are arbitrary functions of 4> and X = —(d<fr) 2 /2, and G± t x and G$,x denote derivatives with 
respect to X. For perturbations, the high order time derivatives are removed by integration by parts. One still need 
to be careful about the correct sign of the kinetic term, which cuts about half of the parameter space. The linear 
perturbations are considered in [136] and the non-Gaussianities are calculated in |137j . 

Historically, the P{<j>, X) term in the ( 222 ) is considered as K-inflation |138[ I139|. Later the G3 term is added 



inspired by the Galilean symmetry [140, 141 . And finally the most general form (222) is rediscovered in the context 
of cosmology. 

The full theory of (222) is complicated and we are not going into the details here. However, it is interesting to 
consider the simplest non- minimal case of the theory, where C — P((j),X), and all other terms vanish [11381 1139] I142[ 
143 . In this case, by varying the action with respect to the metric, the energy density takes the form 



p = 2XP X - P . 



(223) 



And one can find the pressure is p = P((j),X). The Friedmann equation and the continuity equation still hold: 

3M 2 H 2 = p , p + 3H(p + p) = . (224) 
For the theory of perturbations, we work in the £-gauge. The second order action can be derived as 



dtd^x 



where 



c 2 - 



P 



X 



r C 2 - ae(d(f 



Px 

I- 2XP XX 



(225) 



(226) 



Note that in the minimal inflation scenario, where P(X) = X and c 2 = 1. However here this parameter is generalized, 
and can be understood as the sound speed of the perturbations. This is because now the temporal and spatial 
directions have different weight in the action. 



63 It may be fine if the Lagrangian has infinite orders of time derivatives which resums to a ghost-free form I131H133) . 
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To solve the linear perturbation equations, one typically assumes slow variation of c s : s = c s /(Hc s ) <C 1. In this 
case, the £ field can be quantized and solved as 



Ck 



H 



fc"-k i 



Uk 



Mpy/^ecsk 3 



(1 + ikc s r)e 



(227) 



Note again that now the oscillation factor in the time direction is e~ lkCsT . On the other hand, the spatial Fourier 
transform still has the same factor e lk x . Thus a propagating wave described by this Lagrangian has a modified speed. 
The power spectrum can be calculated as 



H 2 



8n 2 M 2 c s e 



(228) 



The only difference from the minimal inflation model is the factor of c s . When c~ < 1, this is an enhancement factor. 
The quantum fluctuations are larger because the perturbations are frozen at a smaller horizon. 
For non-Gaussianities, the leading order terms in the slow roll approximation is 



did x a 



E l 



H 3 



3 4(i-c 2 s )CC 2 



r(l-aC(3C) 2 



(229) 



where 



E = XP X + 2X 2 P XX 



H 2 e 

n2 



X = X 2 P 



XX 



-X Pxxx ■ 



(230) 



Using some integration by parts and making use of the equation of motion, |229|) can further be simplified into 

A 



<>4 



S 3 = 



dtd 3 x a 3 



-2 — C + 



2 H 3 



Using the in-in formalism, the bispectrum can be calculated as 



(Ck 1 Ck 2 Ck 3 ) = (2^) 7 ^(k 1 + k 2 + k 3 



h 2 h 2 h 2 
" , l /i 2"'3 



J r (ki/k 3 ,k 2 /k 3 ) , 



where 
T = 



1 . 2A 



- 1 



3k 1 k 2 k 3 



2(fc 1 + fc 2 + fc 3 ) 3 

12 13 2 3 



1.2 u3 , dp , pp 
12 13 2 3 



p{.3 , 1.21,3 i 1.21,3 
2 1 — 3 1 3 2 



kf 



kr. 



k 3 



kik 2 k 3 {ki 



2fcifc 2 A:3(fci + k 2 +k 3 ) 2 



8kik 2 k 3 



(231) 



(232) 



(233) 



In the squeezed limit k\jk 2 — > 0, it is clear that the first term vanishes proportional to k\jk 2 . The second term 
appears divergent. However, both the divergence and the constant term near k\jk 2 — > actually cancel. To see this, 
consider the limit k\ — > with k 3 — > k 2 + ak\. In this limit, 



T 



1 



f 



2A \ 3fci 
2fe 



1 



1 



(5a 2 - ll)fci 
16fe 



0{k\) . 



(234) 



Thus J- oc k±/k 2 near k\jk 2 — > 0, where the linear term depends on how the squeezed limit is taken. The shape 
functions are plotted in Fig. |17| The left and right panels are respectively the normalized shape functions 



TJk\k 2 k 3 



24 

y 



pp , pp 

12 13 



k\k 2 k 3 (ki + k 2 + fc 3 ) 



pp pp , pp 

2 3 1 2 1 3 



pp _i_ pp i pp i pp p i p _i_ p 
2 3 2 1 — ' 3 1 — ' 3 2 1 2 — ^ 3 



2kik 2 k 3 (ki 



8kik 2 k 3 



(235) 



This same form of Lagrangian can also be obtained by starting from the 50-gauge and do a gauge transformation to the f-gauge. To 
get those leading order terms, the gravitational interaction in the 50-gauge can be ignored. And the gauge transformation only needs 
to be expanded into the leading order S(f> = —<j>£/H. Thus the calculation is much simplified in the <5</)-gauge plus gauge transformation 
way. 
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Those shapes are known as equilateral shapes. Note that those shape functions are not separable. In other words, 
the shape functions cannot be written in the form J^L (kfi (ki) fi 2 \k 2 ) fi 3 \k 3 ) . Technically it is much harder to do 
data analysis of non-separable shapes than separable ones. For this purpose, some shape templates are proposed to 
mimic the equilateral shapes |144j : The equilateral template 

Te = 6 (-2 - A- - A- - A- + h/k 2 + h/k 3 + k 2 /h + k 2 /k 3 + k 3 /h + k 3 /k 2 ) , (236) 



k 2 k 3 k : k 3 k 2 k x 

and the orthogonal template 

/ OI.2 oj.2 oi,2 

.Fo = 6 -8 - — ^ - — 2 --— z -+ 3fc!/fc 2 + 3fc!/fc 3 + 3fc 2 /fe! + 3k 2 /k 3 + 3k 3 /h + 3k 3 /k 2 . (237) 
V fc 2«3 kik 3 k 2 ki 



Those templates are plotted in Fig. 18 The coefficients multiplying those shapes are constrainted as |115j 



^equil = _ 42 ± ?5 ^ ^ortho = _ 25 ± 39 ( 68% CL statistical ) . (238) 
(a) equilateral shape 1 (b) equilateral shape 2 





FIG. 18: Left panel: the equilateral template -F e /6. Right panel: the orthogonal template -F /6. 



The trispectrum of this P(X,<fi) theory can also be calculated using the in-in formalism |102l 1145] , It is useful to 
note that non-trivial transformation ( 185 ) between the Lagrangian to the Hamiltonian has to be considered. Also the 
scalar propagation diagram (like the right panel of Fig. 16 ) needs to be included. 
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2. Effective field theory of perturbations 

In this subsection, we take a different point of view for extending the cosmic perturbation theory. Unlike the previous 
section, here we assume an inflationary background, and see what is the general perturbation theory living on that 
background. This approach is known as the effective field theory (EFT) approach for inflationary perturbations [146] 
(see also, HUHHS]). 

As in the general discussion of EFT, any operator respecting the symmetry of the theory could appear in the 
Lagrangian. The relative importance of those operators are determined by the dimension of the operator. Operators 
with higher dimensions are suppressed more by the UV cutoff scale of the EFT. 

Now we are to write down all possible terms for perturbations. Considering that there are many possibilities, it is 
helpful to choose a gauge to reduce the number of terms. The gauge to choose here is the C-gauge, where S<fi = 0. 
In this gauge, the most general Lagrangian for single field inflation, with the foliation-preserving diff-invariance 
x 1 — >• x 1 + x) is 



S 



\m 2 p r 



<t)g 



mi 



A(t) + -M*(t)(g 00 + lf + -Ml(t)(g 



Mf(t) i 



(g 00 + i)SK£ - 



Mi(t) 



00 



I) 1 



(239) 



where SK^ = Kn V — a 2 Hh^. The dots stand for higher order terms (including higher order in perturbation variable 

that can be written down straightforwardly if needed. It can be shown that other 
ftvptri etc can be rewritten in terms of the above operators, up to some total derivatives. 



C and higher order in derivatives 
operators as R^upa, R 00 , ^ 3 'R, 
Expanding around the FRW background, one finds 



c(t) 



-MpH 



A(f) = M 2 (3H 2 + H) 



(240) 



The form of ( 239 ) is not yet very useful. This is because we have to solve all the complicated gravitational interactions, 



which is possible but messy. Fortunately, there are a hierarchy of energy scales in the theory, among which the Planck 
mass is the highest. This implies the existence of a range of energy where gravitational interactions decouple. 

To find this range of energy, together with the simplified action with decoupled gravity, we should go away from the 
C-gauge. As we discussed in previous sections, there are a lot of cancellations in the C-gauge, which makes decoupling 
not transparent at all. 

The C-gauge is chosen by picking up a specific time coordinate. Thus the C-gauge can be undone by promoting the 
C-gauge time to a scalar field T(t, x), such that in the C-gauge, T^(t^,x) = (where the subscript C denotes quantity 
in the C-gauge). Now consider a general gauge 



t = t 



C 



?r(t,x) 



T(t, x) transforms as a scale field thus 

T(t,x) = T c (i c ,x) =T c (t + 7r(<,x),x) = t + tt . 
Now we are ready to make the C-gauge quantities into tensors from T(^,x): 



(241) 
(242) 
(243) 



S°9' 



(244) 



7°° = «<T 



6° 



51 + 



y/-g 00 ~ 2g°»d ll TT - g^d^d^ 



(245) 
(246) 



where the left column are unitary gauge quantities and the right column are quantity without choosing a gauge. By 
doing the above replacements to the action (239), together with the replacement (241 1, we return from the C-gauge 
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to a general coordinate system without choosing a gauge: 

1 



S 



(Fa* 



M, 



/2 



MpH(t + ir) [(1 + ir) 2 g m + 2(1 + n)d^g° l + g'i)-<),- 



M$(t 



+ 



Ml{t 



[(1 + ir) 2 g m + 2(1 + n)d t irg 0x + g '<>-(),- + if 
[(1 + 7r) 2 g 00 + 2(1 + TT^ng 01 + g^d^d^ + l\ 



}■ 



(247) 



where we neglect more terms in • • • for simplicity. 

After re-obtaining the gauge invariant action, we are free to choose the (5</>-gauge (or any other gauge close to the 
(50-gauge, like the Newtonian gauge). The advantage of (5</>-gauge, or any gauge which is close to that, is that near 
the horizon crossing, gravity is weakly coupled and only provide slow roll suppressed corrections. An example of this 
statement can be found in Eq. ( 77 ) . More generally, we can compare the terms with and without dynamical gravity 



in the new action, and conclude: The decoupling energy range for matter perturbations from gravity is 65 

k 



HM p 1 M p M p 



(248) 



where the dots denotes other scales (not shown in (247)) that could appear, which are also suppressed by M p . Note 
that (j)/M p ~ y/eH , and <f>/(HM p ) ~ iJfjH. This result is intuitive to understand: when all scales appearing in the 
theory are smaller than M p , gravity should be weak. 

However, when k/a < w mrx , things are different. Because small physical wave number means large length scales. 
And gravity accumulates on large length scales. Huge lump of matter makes gravity coupling non-neglectable. As an 
explicit example, the C(e 2 ) and 0{erf) terms in the gravitational action ( 169 1 produce non-Gaussianities Jnl ~ C(e, rf) 
on energy scales k/a < w m i x 66 . 

Nevertheless, if we are interested in physics of k/a 3> w m ix, or non-Gaussianity with f^L 3> &(e,v)i ^ is safe to 
use the action with gravity (including the constraints in the gravity sector) decoupled: 



S 



d 4 x. 



-MpR 



2M, 4 



. (din) 

7T K~ 



(249) 



Eq. ( 249 1 recovers the action ( 229 1 , without referring to a background model explicitly. Moreover, in the • • • terms 



we neglected here (but can be recovered straightforwardly), a much wider class of models are covered, for example, 
ghost inflation [HH [TSD1 [T3T] . 

By construction, EFT requires in principle a UV completion. The ultimate UV completion of inflation is not yet 
known. But it is interesting to see in toy models how UV degrees of freedoms are integrated out to get the EFT in 
cosmology. For example, EFT operators arise when a very massive scalar is integrated out [152[ 1153] . 

Although we have put the EFT approach here as a generalization of single field inflation, the usage of EFT is not 
limited to single field (or field theory at all). Instead, the EFT approach can be used widely in cosmology, including 
applications in multi- field inflation |154j , quasi-single field inflation |155[ I156j and large scale structures [157] . 



3. Generalizing the initial condition 



In the previous subsections we have discussed how the operators of inflation could be generalized, on the full and 
perturbation level respectively. However, physical observables not only depend on operators but also depend on states. 



Note that the w m i x should be understood as a wave number instead of an energy scale. Because we are comparing w m i x with k/a. On 
super-Hubble scales there are a huge number of particles in perturbation mode k thus the corresponding actual energy is much greater 
than u; m i x . 

The relation between local /jvl and w m i x can be estimated as follows: EFT approach (with decoupled gravity) predicts vanishing 
squeezed limit. And on super-Hubble scales, the time dependence of £ decays as f oc [k/ (aH)] 2 . Thus the gravitational non-Gaussianity, 
coming from k/a < w m i x , should be at most of order /jvl oc (^ m i x /i?) 2 oc max(e, n). And we have checked the local /jvx is indeed of 
this order. 
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We have argued that it is natural to choose the initially lowest energy state as the vacuum state. But yet there can 
be generalizations. 



The simplest generalization follows from Eq. ( 33 ) . Written in terms of the mode function of comoving curvature 
perturbation £, we have [143] 



H 



c£ +) (1 + ikr)e- lkT + C { -\\ - ikr)e l 



(250) 



One can turn on the c\. ' component, with |C^ + ' ) | 2 — \Cy, '\ 2 = 1. In this case, we have for the power spectrum 



Pt 



H 2 



8n 2 eM 2 



(251) 



In case C { k ±] have some phase-sensitive fc-dependence, there could be small oscillations on top of the nearly scale 
invariant power spectrum. 

Things become more interesting at the level of non-Gaussianity. We restrict our attention to a small non-BD 
component | c£ ^ | <C |Cfc |- Including a factor of C^/C^ effectively flips the sign of k (sign of amplitude, not the 
vector direction of k) . Recall that we had a shape function 



1.2 1,2 iUXU 

kik 2 k 3 (k 1 + k 2 



k 2 h 2 



k 3 ) 



in Eq. (187 1 of the minimal inflation model. With sign flip of the fcj's 67 , we get 

1 



Re[Cl 



(_). fcj k 2 + ^1^3 + k 2 k 3 



1 



1 



-fci + k 2 + k 3 ki — k 2 



ki + k 2 - k 3 



(252) 



(253) 



This shape function blows up at the folded triangle limit k 2 + fc 3 — ¥ fc l7 or its permutations. This folded shape is 
plotted in Fig. 19 In the squeezed limit k\jk 2 — > 0, k 3 = ak 2 , this shape function behaves as 



Re[C£ 



(-)i k l 



>k 2 1 



(254) 



which blows up dramatically - More quickly than the local shape, and depends on how the limit is taken. 

However, we need to be careful in interpreting the divergence at the folded limit. This is because this divergence 
originates in the UV. On the one hand, it is tricky to define the UV limit of a non-vacuum state. On the other hand, 
the ie prescription of selecting the interacting vacuum no longer works for an excited state. One may put a cutoff 
by hand to cure those problems phenomenologically [143] . However, the prediction is sensitive to the detailed cutoff 
scheme and thus sensitive to the UV physics [158] . An explicit UV completion of physics above the cutoff can be 
found in |159j . 

folded shape 




4 k 2 /k 3 



FIG. 19: A plot of the shape function ( 253 1 



Note that only the k in the (1 + ikr)e lkT part of the mode function is flipped. The k3 part of the mode function docs not change 
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4- Generalizing the attractor behavior 



Inflation is attractive as an attractor solution. And we have implicitly used this nature in various places. In case 
that during a period of time the attractor behavior is violated, the inflationary perturbations could behave differently 

For example, typically we consider an inflaton field rolling down its potential. However, what happens when the 
inflaton use its kinetic energy to climb up a potential? 

Note that the kinetic energy of the inflaton itself has equation of state pkin = Pkin- Thus even there is no potential, 
the kinetic energy dilutes as p^in a~ e . With a potential, the kinetic energy could only dilute faster. Thus we get 
r] < —6 during the climbing up stage of the potential. 

Recall the equation of motion for the comoving curvature perturbation: 

k 2 

( + H(3 + ri)(+ -5-C = . (255) 

When rj < 3, C effectively live in a contracting universe (though the actual universe is expanding). As a result, the 
conventionally decaying mode of £ becomes a growing mode and dominates over the constant mode. One can show 
that when r\ ~ —6, the power spectrum is still scale invariant. However the non-Gaussianity becomes /jyx ~ 5/2, 
which is not only order one but also breaks the consistency relation of single field inflation [1601 11(51] . One may 
further enhance this local non-Gaussianity by considering a small sound speed |158 . Qualitatively, the behavior of 
the perturbations are similar to the case of bouncing cosmology [162j . 



B. Multi-field inflation 



Here we generalize inflation towards another direction: including more light fields. 



1. Adiabatic and entropy perturbations 

The status of perfect fluid can be described by its energy density p(t, x), pressure pit, x) and velocity u l (t, x). On 
super-Hubble scales v z (t,x) decays exponentially thus the status of the perfect fluid is determined by pit, x) and 
p(t,x). 

Further, if the fluid has only one component, p is determined by p as p = p{p). In this case, we denote the fluctuation 
of the fluid as adiabatic perturbation. If it is not the case, we can parameterize p as 

P = p{p,Si). (256) 

Recall that when there is no entropy perturbation, the comoving curvature perturbation is conserved, following 
Eq. (97), on uniform energy density slice p = p(t): 

H(t) + ((t, x ) = -\— v ^l . (257) 

3 p[t) +p{p,Si) 

Thus we can define the non-adiabatic part of pressure according to the derivation from this equation. 
Consider a general situation without choosing a gauge. We note: 

p = Po (t) + Sp(t, x), 8p(t, x) = ^5p + |£ (sat - -Sp) +■■■ , (258) 

Po osi \ p J 

where • • • denotes higher order terms in perturbations. 

It is the second term which make C change on super-Hubble scales. Thus we can define the non-adiabatic part of 
p(p,Si) as |163j 

tfPnad = ( Ss i ~ ^ S p) H =Sp- ^Sp . (259) 

os, V P J Po 

This non-adiabatic part, or entropy part of pressure has clear physical meaning: It is the difference of equation of 
state, between perturbing the fluid and evolving the fluid in time. When there is such a difference, perturbation in 
the universe cannot be considered as a pure time shift between different local universes. 
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Now we identify the adiabatic and entropy perturbations for inflation. In case there are multiple rolling 68 fields 
i). We can rotate the basis in the field space of those fields, such that there is only one rolling field 



[1641 1165] . This can be done by considering basis vector 



if 



(260) 



where superscript (0) denotes the background level of the field, without perturbations. And the other (n — 1) basis 
vector are defined as the orthogonal directions. The rotated fields, including the background and the perturbations, 
are defined as 

Xa = eifa . (261) 

After this change of basis, the adiabatic direction, or called the inflation direction, is xi- All other directions X2, ■ • • Xn 
has Xi — 0. In words, those fields in the new basis then have flat potential thus not rolling. Those non-rolling fields does 
not contribute energy density or pressure at leading order in field perturbations. Thus at leading order perturbation 
theory, the fluctuation of those field are entropy perturbations. This decomposition is illustrated in Fig. [20] 




FIG. 20: Redefining the field directions for multi-field inflation. The \i direction is the inflationary direction and the 
perturbation on xi direction is the adiabatic perturbation. The other directions are on the hypersurface perpendicular to xi- 
Fluctuations on those directions are entropy perturbations. 



When the inflation trajectory is straight in field space, the basis vectors e l a are time independent and the rotation 
is trivial. Otherwise the basis vectors e % a are time dependent. 

For models with standard kinetic term, and at linear order in perturbations, xi is coupled to Xii' ' ' iXn direction 
only through a rotation of basis. In the case of straight trajectory those directions decouple. This is because vanishing 
Xi-, • • • and dV/dxi, • • • forbid direct coupling. 

On super-Hubble scales, a time dependence of C can be generated by this turning of trajectory: 

C = ~*JW - -^ei<tyj . (262) 



P + P 



Xi 



Originally very massive fields are not rolling, and remain not rolling after this rotate of basis. 
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Thus the adiabatic perturbation is sourced by the entropy perturbation along the e\ direction, 
direction is perpendicular to e\. It is sometimes convenient [166] to take this direction as e|: 



Note that the e\ 



£7 



(263) 



In this case the e| direction is the source of time dependent £ on super-Hubble scales. 

Note that those new add-ons of curvature perturbation come from different source from the quantum fluctuation of 
the original inflaton direction. Thus the power spectrum tends to increase instead of decrease, with the presence 
of turning of trajectory. It is in general difficult to suppress the adiabatic perturbation, once it is generated [167H169] . 

Before going into detailed models, a few comments are in order here: 

• The adiabatic/entropy perturbations are also called curvature/isocurvature perturbations. 

• Note that we have defined two versions for adiabatic and entropy perturbations: one for fluid and the other 
for field from rotating basis. They coincide at first order in perturbation theory, and super-Hubble scales. At 
second order perturbation theory, the classification of adiabatic/isocurvature perturbation becomes not unique. 
One can consistently use either of them. Also on sub-Hubble scales, even a single field has non- vanishing Sp na d- 

• Sometimes (e.g. the curvaton scenario), the fields which are rolling much more slowly than the inflaton (though 
still rolling thus unlike the non- rolling X2, ■ • • fields) are also considered isocurvature directions and generating 
entropy perturbations. 

• We have shown that at linear order, entropy perturbation can convert to the comoving curvature perturbation 
only if the inflationary trajectory turns. However, this is under the assumption of standard kinetic term for 
both fields. Otherwise, kinetic coupling can couple between adiabatic and entropy perturbations without any 
turning of trajectory. 

• At non-linear order, there can be coupling of adiabatic and entropy fluctuations (when defined as in the field 
theory way as Fig. 20 ) even without turning of trajectory. This is because the adiabatic direction is defined at 



the background level. At the perturbation level, the real field trajectory is perturbed slightly. 

In the next a few sections, we are to discuss a few models where the entropy perturbation converts to adiabatic 
perturbation. A summary of those models in light of the Planck data can be found in |170| . 



2. Curvaton mechanism 



In the curvaton mechanism |171H175"] . it is assumed that there is a curvaton field <r, which rolls much more slowly 
than the inflaton during inflation. After inflation, the inflaton decays first. Later the curvaton energy density 



catches up. When the curvaton energy density becomes 
perturbation. This is illustrated in Fig. 21 
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FIG. 21: The curvaton mechanism. 
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For simplicity here we review the simplest scenario of curvaton: the inflaton decays to radiation. And the curvaton 
oscillate around the minima of its potential V(a) = ^m 2 cr 2 before curvaton decay, which happens later than the 
inflaton decay. We use the method developed in [176] . 

Recall the discussion in Sec. |II G| there are separately conserved quantities for non-interacting field species. The 
conserved quantity associated to the curvaton field is 



P<r(t,x) 



dpo 

Pa 



(264) 



where we have used p a = for a fast oscillating field. Here and later in this subsection, we shall omit the superscript 
(gi) on £ CT for simplicity. 

This conserved quantity provides connection between initial condition and observations. Thus we have to: (a) relate 
the conserved quantity with the horizon crossing value of the curvaton field cr*; and (b) relate the conserved quantity 
with the curvature perturbation (. 

To relate ( a with <r*, we note that on spatial flat slice (SN = 0), Eq. (264) can be integrated as 

Pa 

The curvaton energy density is 

1 



e Ka Pa 



(265) 



Pa 



2 2 

-m a 



(266) 



where a is the amplitude of curvaton oscillation. 

However, the quantum initial condition for a perturbations are determined at Hubble-crossing of the corresponding 
perturbation modes. Thus we have to relate a at the oscillation stage to the value at Hubble crossing 69 : rr* = <j\k =a H- 
At Hubble crossing, the curvaton is a spectator field, thus 



where Sa* is Gaussian and has power spectrum 



Hi 
4tt 2 



The relation between a and er* is model dependent. In general, we can write this dependence as 

a = g(a*). 



Combining ( [265] ), ( p66j ), ( p67| and p69j ), we have 70 

1 



r 



fj 



E 



(&T*)™ 

n! 



(267) 



(268) 



(269) 



(270) 



where g = g(a*) and g^ = d n g{a)/da n \ a= ^. Eq. p70} is a nonlinear relation between the a-comoving curvature 
perturbation and the Gaussian fluctuation 6a* . 

We need yet a relation between £ and Ca- To derive this, we make use of ( 101 1 and its counter part for radiation, 
on uniform total energy density slice. As a result, 



Cr = C + ^ In 



Pr = Pr-e 



4(Cr-0 



(271) 



Note that, here we are aiming at a full non-linear analysis of perturbations. Thus rigorously speaking the perturbation is not a single 
k mode and k = aH is not well defined. However, observationally we are only interested in a small range of k. Considering the a 
perturbation is nearly scale invariant, the horizon-crossing value do not differ very much. 

Note here po- is no longer the average energy density because there is also a contribution from \'m? (<5cr) 2 . However, it is still consistent 
and convenient to define the background as 5a = instead of Sp rT =0 . 



3 \Po 



Thus we have, on uniform total energy density slice 71 , 

Pr + Pa = Pr + Pa = p r e MCr ~° + ^e 3 ^ -0 



Gl 



(272) 



(273) 



In case the final adiabatic perturbation is dominated by the curvaton perturbation, we have £ r = 0. In this case, the 
above equation reduces to 



(1 - f2)e- 4c + Oe 3 ^ = 1 



(274) 



where 



n = 



Pa + Pr 



(275) 



Thus finally one can solve £ from Eqs. (270) and (275) order by order in terms of the Gaussian perturbation 8a* 
As a result, 



P c = r 



g'H* 

6irg 



(276) 
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where we have omitted the bar on g, and 
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10r + 3r 2 
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(279) 



with p a and p r evaluated at curvaton decay time. 

Note that in the r <C 1 limit, /nl 3> 1. It is intuitive to understand why there is large non-Gaussianity in the 
curvaton scenario: when r <C 1, the curvaton is sub-dominate. Thus it has to fluctuate harder to have the correct P^. 
The harder curvaton fluctuates, the more non-Gaussian it is. Also, the inflationary slow roll parameter e doesn't enter 
this later time calculation thus /nl is not e-suppressed. This is because the curvaton potential near the curvaton 
oscillation is not slow roll suppressed. 

The other trispectrum estimator is trivial, considering the single-source nature of curvaton mechanism: 



36 
25* 



tnl — t^Inl 



(280) 



In the simplest case, if the curvaton has only quadratic potential, there is no nonlinear evolution on super-Hubble 
scales. Thus in this case g" = and g'" — 0. The above formulas simplify considerably in this limit. Otherwise the 
dynamics can be solved perturbatively or numerically in g [177H179] . 



3. Non-trivial reheating surface in field space 



Another way to convert entropy perturbation to adiabatic perturbation is to make use of a non-trivial reheating 
surface. In this section we consider a non-dynamical reheating surface but dynamical trajectory due to entropy 
perturbations. A dynamical reheating surface is discussed in the next subsection. 



Here we assume curvaton decays on uniform total energy density slice. Otherwise, there is additional slice matching 113 . 
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reheating surface 




FIG. 22: Non-Gaussianity from a non-trivial reheating surface in field space. When the reheating surface is not perpendicular 
to the inflationary trajectory, entropy fluctuation converts into e-folding number difference, and thus adiabatic perturbation. 



Conventionally, we assume inflation terminates when the inflation direction <j> (or \i after a change of basis) reach 
a certain (range of) value, where potential is no longer flat or the inflaton field suddenly decays to something else. 

However, the above simple picture is not necessarily the full story. In general, the reheating surface could be a 
hypersurface in field space, which is not perpendicular to the inflationary trajectory. The reheating surface may be 
even curved instead of flat. Such a generalized reheating surface can convert entropy perturbation into curvature 
perturbation, with a possibility for large non-Gaussianity. This situation is illustrated in Fig. [22} An explicit example 
of non-trivial reheating surface is known as multi-brid inflation |180) , and more general analysis is performed in [181 . 



As shown in the right panel of Fig. 22 the amount of isocurvature perturbation (in terms of field space distance) , 
converted to curvature perturbation, is characterized by the angle a, between the reheating surface and the inflation 
trajectory: 

5<f) = 5a cot a . (281) 

Note that ( = —H5<fi/(f). Thus in case the 5a fluctuation is conserved from Hubble crossing to reheating, then the 
adiabatic power spectrum induced by the entropy perturbation is 

AP c = cot*a^. (282) 

For standard inflaton sector, the total power spectrum is 

*=(l + «* 9 ")g^- (283) 

More generally, the a field could change its value from Hubble crossing to reheating. In this case the technique in the 
previous section can be employed to calculate the density perturbation. 

The non-Gaussianity of this scenario follows from how the reheating surface is curved |181j . When the inflaton 
trajectory hits a convex reheating surface, a positive /nl is created. 

This fact can be observed geometrically. For small isocurvature perturbations, the curved reheating surface can 



be approximated as a circle. Consider the left panel of Fig. 23 Here the green and red solid lines denote different 
inflation trajectories, with perturb ation — 5a and 5a respectively. Note that we have \BD\ > \AC\. 



Recall the observation of section 
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C = Cs + f /jViCo + " " " • Thus positive Jnl corresponds to an enhancement 
of positive £ g . Note that the red line (positive 5a) is positive ( g (because of greater 5N), and \BD\ > \AC\ means the 
second order effects enhance the 5N of the red line compared with the green. Thus when the trajectory hits a convex 
reheating surface, a positive fxL is produced. 



Proof: in the figure the line intervals satisfy \AO\ = \BO\ and \CO\ = \D 0\, where O is the center of the circle. From the construction 
we have angles /BOD > LAOC (this can be proved by rotating triangle AAOC by angle LAOS). Thus interval \BD\ > \AC\ 
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Note that if the inflation trajectory is perpendicular to the reheating surface, there is vanishing three-point function 
because of the symmetry of the configuration. However, once the trajectory hits the curved reheating surface with an 
angle, the sign of f NL only depends on how the reheating surface is curved. 

Similarly, hitting a concave surface produces a negative Jnl- A reheating surface without any curvature converts 
the isocurvature perturbation to curvature perturbation linearly and thus there is no non-Gaussianity. This situation 
is illustrated in the right panel of Fig. [23| 




FIG. 23: The curvature of reheating surface (or any other hypersurface on which entropy perturbation is converted into 
adiabatic perturbation) and the sign of non-Gaussianity. 

The relation between curved reheating surface and non-Gaussianity can also be generalized. Note that the curved 
reheating surface is nothing but a hypersurface to convert entropy perturbation to adiabatic perturbation. If during 
inflation, or after reheating, such conversion happens, the blue surface in Fig. [23] can be considered as the conversion 
surface from entropy to adiabatic perturbations. 

Further, it is still possible that the reheating surface (or other hypersurface with entropy to adiabatic conversion) 
is not curved, however the field trajectories themselves are diverging or converging when they hit the surface. This 
situation is illustrated in Fig. [24] 

Finally, it is interesting to generalize the above argument from a "hypersurface" of entropy to adiabatic conversion, 
into some continuous conversion mechanism (see, e.g. |182j ). Or consider originally non-Gaussian 5a fluctuations 
(thus they are no longer Q). 




fNL<0 f NL <0 



FIG. 24: A bundle of diverging trajectories corresponds to positive Jnl- This is because the positive part of £ 3 converts more 
and the negative part converts less. Similarly, a bundle of converging trajectories corresponds to negative fNL- 



4- Modulated reheating 



In this subsection we again play with the reheating surface of multi-field inflation. However, instead of the way to 
hit reheating surface, here we make the shape of the reheating surface dynamically depend on the entropy fluctuations. 
This scenario is known as the modulated reheating [1161 [183, 184 . This situation is illustrated in Fig. 25 
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FIG. 25: Modulated reheating. Note that despite the similarly looking of this figure with Fig. 22 here the "/" direction axis 



denotes spatial coordinate, instead of field value. The other axis has the same meaning (inflaton field value). 



Consider that the decay rate of inflaton depends on the entropy field: 

T = T{a) . 

During reheating, the inflaton energy density p^ decays into radiation p r . This process can be described by 

dp4> 



dN 



r 

H 1 



(284) 



(285) 



dp r r 

dN +4pr =H P * 



(286) 



3M 2 ff 2 = P4> + Pr 



(287) 



The above equations are to be solved with initial condition, at a time to before the inflaton decay, as p<^(to) = 3-Mri?Q 
and p r (to) = 0. The e-folding number during this process can be calculated as 



2 H f 



where Q satisfies 



exp 



4Q 



Expanding Eq. (288), one obtains 
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Q 
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e 4 » r p^N') 
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(288) 
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AP C = Q' 



, r 

Y 2^ 



(290) 



We again assumed extremely slow-rolling of a. Otherwise the dynamics of a should be tracked similarly to the 
curvaton scenario. The non-Gaussianities can be calculated by expanding Eq. (288) to higher orders |116j . 

For the sign of non-Gaussianities, the intuition in the last section (Fig. 25 1 still applies. Now we work in position 



space instead of field space. The distribution of a field is Gaussian. However, the conversion T{a) makes the reheating 
surface non-Gaussianly curved. The second derivative of T(a) determines the sign of non-Gaussianity. 
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5. Modulated expansion history 

In this section we discuss a mechanism similar to the modulated reheating, where the expansion history, instead of 
the decay rate, is modulated by an isocurvature direction j!85j 73 . 

For example, after inflation ends, the inflaton field decays to thermalized particles x, with mass m. The reheating 
temperature is Tj m. The \ particles are relatively long lived, such that it decays after the temperature of the 
universe drops to T F <C m. Thus from reheating to %-decay, the universe is initially radiation dominated and later 
matter dominated. After the x _ decay to lighter particles the universe becomes hot again. 

To track the local expansion history of this period, here we use a sudden transition approximation, neglecting the 
transition period between radiation domination and radiation radiation. This should be a reasonable approximation, 
because after m varies at different local universes, those different local universes undergoes the same transition (at 
different times). Thus the error cancels when we calculate the e- folding number difference SN. 

We use Te and He to denote the temperature and Hubble parameter at matter radiation equality time, for the 
X-particle dominated universe. The expansion from the initial time to the equality time is 

N IE = log(aj/a E ) = - log(HE/Hj) , (291) 
and the expansion from the equality time to the final time is 

N EF = \og{a F / ai ) = | \og{H F /H E ) . (292) 
The total e- folds is a summation of the two stages: 

N IF = -i log Hi-\ log H E + I log H F . (293) 
2 b 6 

To relate the expansion to the ^iV-formalism, we have to choose the initial slice to be the flat slice and the final 
slice to be the uniform energy density slice. Assuming the original density fluctuation of the \ particles (originates 
from inflaton fluctuation) is negligible. In this case the uniform Hj slice is indeed the initial flat slice. On the other 
hand, the final uniform H F slice is the uniform energy density slice because H F oc y/p~F from the Friedmann equation. 
Thus when we vary Njp, Hj and H F should be fixed. The only variation comes from H F - The mass, energy density 
and Hubble parameter can be related as 74 

Pe ^ 2~q9 T e x mi > H E oc yfp^ oc m 2 . (294) 

Here we still approximately used the energy density of radiation dominated epoch. The difference results in an error 
of the sudden transition approximation. 

Thus the comoving curvature perturbation can be calculated as 

t XAT 1SHe l6m COOK"* 

C = (W/F = -6l^ = -3^- (295) 

Now we need to calculate 5m in the JA^-formalism. As a simplest case: we may directly have m = m(a), where a 
is an isocurvature field which gets fluctuations during inflation. In this case, 5m = m'(a)5a. If a is rolling extremely 
slowly during inflation, such that the Hubble-exit value is the same as the value at equality time, the induced power 
spectrum for curvature perturbation is 

1 fm'\ 2 „ Hi frri 



P < = 9{m) P -=36^UJ ' (296) 
where H„ is the horizon crossing value of Hubble parameter. Similarly, the non-Gaussian estimators can be calculated. 



Note that the same model is also calculated in [186] . using an integral constraint in GR. The result is in conflict with the result from 
SN formalism. Nevertheless, here we focus on the method in |185| . reformulated following closely the principles of the SN formalism. 
There is an additional factor of 7/8 in the energy density for fermions, which do not affect the final result. 
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On the other hand, if a is rolling not so slowly, but follow linear evolution (i.e. the potential is just a mass term), 
8a I a is a constant on super-Hubble scales. In this case, when m oc cr", we have m'/m — aSa/a and 

a 2 H 2 

P < = 36# • < 297 > 

Finally, if the a potential is dominated by terms other than mass during some periods of time evolution, one can 
define a function g such that a = g {<?*), and calculate g perturbatively, as is done in the curvaton mechanism. 

As the last step, x has to decay, preferably to radiation components and re-reheat the universe. Note that the 
decay rate of x a ls° depends on m. There are at least two possibilities: 

No modulation in re-reheating: It is possible that the a rolls back to origin, or decays, before the decay of x- I n 
this case, at x decay, the mass m no longer depends on a. Thus the re-reheating at x decay does not introduce extra 
curvature perturbations. 

Modulated re-reheating: In this case, the discussion for the extra curvature perturbation transfer is in parallel to 
the modulated reheating. 

As alternative realizations, one may either modulate the expansion rate at other stages of the universe. For example, 
one can modulate the dilution rate of isocurvature direction, to generate entropy perturbation. As another example, 
during the oscillation of the inflaton, before reheating, the shape of the potential may get modulated. For example, 
near the end of inflation, the mass term of inflaton may get an anomalous dimension 75 : 

V = ^TiVexp T lo S - ' ( 298 ) 
2 [Am 

As a result, the dilution of inflaton no longer has equation of state p/p = 0, but instead 

P - _^ (299) 

p ~ a + 4A " 4A ' 1 ' 

Thus the expansion history is modified as a function of the a value in local universes. Assuming a <C A, 

( = --e 3wAN ~-^-AN , (300) 
s 3 4A v ; 

where AiV is the number of e-folds of inflaton oscillation before it decays. The inflationary power spectrum follows 
from squaring this equation. 



6. A landscape of inflatons and isocurvatons? 

In the previous subsections, we have been making the multi-field inflationary potential complicated. Inspired by the 
string landscape |187H190] . it is argued that inflation may take place on a rather complicated potential [166. H9lTll92) . 
Yet there is a question: how complicated the inflationary potential could be? 

To see the physics more closely, we decomposed the question into a few more concrete ones: 

• How complicated could a potential be on one field direction? Note that slow roll of inflationary direction already 
tightly constraint the potential along the inflation direction. We postpone the discussion of possible break down 
of slow roll to Subsection IIVDI 

• How many fields could there be during inflation? The constraint |1931 1194) is quite weak. Nevertheless a weak 
constraint is stronger than no constraint |195j . 

Note that during inflation, each massless (actually m < H should be enough) field has a fluctuation cr,; ~ H per 
Hubble volume. Thus the total energy density of n massless fields is 

n 

Pn ~ ^(dcr,) 2 ~ nH* . (301) 



75 We assume other terms drives inflation, and this potential only applies near the end of inflation. Otherwise serve fine tuning is needed 
to make a massless. 
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FIG. 26: Once inflation takes place on a complicated multi-field potential, the inflationary trajectory (white lines in the figure) 
may bifurcate. The bifurcation, if happened during observable state, and not under control, brings deserters. 



This must be much smaller than the background energy density p. Thus we have 

(302) 

p 

This constraint can also be argued as that, near Hubble crossing, the quantum perturbation of each massless 
field becomes classical and carry of order one bit entropy (about its phase). However, de Sitter space has an 



entropy bound of H 2 /M 2 . Thus the field number is bounded by (302). 

Note that this number is greater than 10 10 . Thus most models are unconstrained, unless one wants the number 
of fields to be greater than 1/P( or 1/Ph- 

What extras could a multi-field complicated potential bring compared with a single field one? 

As we already discussed, in multi-field inflation, much more things could happen compared to the case of single 
field inflation. Here regarding to a complicated potential, there is at least one more: The classical trajectory of 
inflation may bifurcate during inflation [96 . Inflation with such a bifurcation is called multi-stream inflation, 



as illustrated in Fig. 26 



In case the bifurcation of inflationary trajectory is under control (with symmetry or fine-tuning), those bi- 
furcations can be responsible to non-Gaussianities or position space features such as asymmetries and/or 



cold spots on the CMB [196) . Or alternatively the bifurcation could happen before the observable stage of 
inflation [197 , 198J. However, if the bifurcation is due to a random potential, the e-folding number difference 
between different trajectories are not under control. In this case the CMB sky becomes extremely anisotropic 
and non-Gaussian. This kind of models should be ruled out. 

It is shown that in a double-field random potential, bifurcation could happen with some not-so-fine tuning of 
potential parameters |97j . When the random potential has more dimensions, the probability for bifurcation 
should increase thus there may be a tighter constraint. 

There is also a danger of rapid tunneling in a random potential, as discussed in 199, 1200j . 
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C. Quasi-single field inflation 

1. Energy scales during inflation 

We have discussed single field inflation and multi-field inflation. Definitely we cannot cover even a small portion of 
the huge literature. Nevertheless at first look the classification of single/multiple field covers all possibilities. 

However, it is actually not the case. Note that inflation is a theory with multiple energy scales built in. The 
inflationary background defines an energy scale Eb = ^JfjvH , whereas the perturbations defines another energy scale 
Ep = H . There is a hierarchy between those two energy scales. 

The hierarchy of energy scales allows a class of inflation models, in which at the homogeneous and isotropic 
background level, the models look like single field inflation. However, when we consider cosmic perturbations, which 
live at a higher energy scale, the physics becomes different from single field. This class of models is called quasi-single 
field inflation [HE] [H] [US] . 



For example, consider a scalar field a with mass m ~ H during inflation (Fig. 27). At the background level, this 
field stays at its minimum of the potential, because the mass is higher than the background energy scale and not slow 
rolling. Thus at the background level we have single field inflation. 

However, at the energy scale of perturbations, m ~ H is not too heavy. It takes quite a few e-folds for the a 
perturbation to roll back to the origin. As a result, there is room for the a perturbation, which appears as entropy 
perturbation, to convert to curvature perturbation. 



M«H : multi-field M~H : quasi-single field M»H : single field (?) M-iso 

FIG. 27: Energy scales during inflation. 

Note that the potential of the a field is not restricted by slow roll condition. Thus the potential need not to be very 
flat. As a result, the self interactions of a can be strong. When the entropy perturbations in the a direction converts 
to curvature perturbation, the non-Gaussianity of a, from the self interactions, are also converted. The generation 
and conversion of non-Gaussianity is near Hubble crossing. This is different from the local non-Gaussianity (which 
is generated on super-Hubble scales) or equilateral non-Gaussianity (which is generated on sub-Hubble scales). As a 
result, a family of new shapes of non-Gaussianity arises, controlled by the mass m of a. 



V. 



2. Mass uplifting of flat directions 



As we discussed in Sec. IG there is an ^y-problem of inflation: Originally massless fields (unless protected by 
symmetry) get uplifted to m ~ H during inflation. Note that this is true not only for the inflaton field, but also for 
other originally massless fields. 

If we have some originally massless fields in field theory. Now inflationary back-reaction typically uplift them to 
m ~ H . Some of them may have m <C H by pure chance 76 or some fine tuning. They can behave as the inflaton, 
because we want inflation. The other fields has m ~ H. 

Among those m ~ H fields, some may decouple from inflaton. The fluctuation of those fields are diluted after a 
few e-folds of super-Hubble evolution and thus not observable. However, others may couple to inflaton, via operators 

[Hi] 

<j> 2 a, {d^fa , ad^a , • • • . (303) 



The chance may not be small if there are many m ~ H fields. For example, assuming the mass of the field obeys Gaussian distribution 
exp[— c 2 (m — H) 2 /H 2 ], one needs \/2c/[erf(c) — erf(0.9c)] fields to naturally have one or more of them lying in range < m < 0.1H. 
For c = 0.5, 16 fields are needed; for c = 1, 31 fields are needed; for c = 5, 10 10 fields are needed. 
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3. A simple model 



It is helpful to consider a simple model to see explicitly how the above argument works [20 lj [98|. To do that, we 
consider a model in which the inflation trajectory is along an arc. The angular direction is the inflaton and the radius 
direction is the massive field a. We illustrate the case in Fig. [28] 




FIG. 28: A simple example of quasi-single field inflation. The isocurvature direction is not slow roll restricted. Thus the 
entropy fluctuation can be non-Gaussian. During the turning of trajectory, the non-Gaussian entropy fluctuation is converted 
to the adiabatic perturbation, along the inflation direction. 



For simplicity, we further assume the potential for the two scalars can be written in the factorized form when 
written in the polar coordinates. In this case, we can write down the theory in polar coordinates as 



S m = I d x^g 



+ of g^d^e - ^ v d^d v a - V SI (9) - V(a) 



(304) 



where V ST (9) is the slow roll potential in the inflation direction; V(a) is the potential in the V" = m ~ H direction, 
without slow roll restriction; and 1Z is the radius of the turning potential. Gravity minimally couples to this matter 
sector. The background equations of motion are 



3M 2 iJ 2 = ]-TZ 2 9 2 + V + V SI 



-2M 2 H = H 2 ° 2 



o • 



(305) 



where TZ is the radius of turning trajectory, which has a shift by a constant compared with TZ due to the centrifugal 
force: a n = R — R, such that V'(a ) — R9q. 

As we have shown in the general discussion of adiabatic and entropy perturbations, the turning of trajectory couples 
the adiabatic and entropy directions. This can be verified in the present model by expanding (304 1 in perturbations. 
The interaction is 



AC 2 = 2a 3 9 x (7156) x 5a 



(306) 



The importance of this term can be estimated as follows: near Hubble-crossing, the canonically normalized field 1Z59 
has fluctuation of order 1Z59 ~ H 2 . The da field is massive thus has amplitude 8a ~ mm(H,H 2 /m). The mass 
dependence is because when m > H, the 8a fluctuation is suppressed [591 202 by (8a 2 ) ~ 1/m 2 , which is the flat 
space estimation of propagator. 

• 2 

To estimate how strong is the coupling A£ 2 , we can compare it with the free Lagrangian £ 2 = 1Z 2 89 H ~ H A . 

We have 



.92 



A£ 2 



9/H, 
= { 9/m, 



if m ~ H 
if m > H 
cutoff needed if m <C H 



(307) 



Because of the transfer vertex, the power spectrum gets a relative correction proportional to (<7 2 ) 2 - This situation 
is illustrated in Fig. 29 
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As a result, the power spectrum goes as 

// 

ff2 



P c = - (1 + 5) , (308) 



where <5 is the relative correction due to the transfer vertex. The analytical expression of 8 is given as 8 = 89 {C\ 



C2)/H 2 in The g2 estimate in (307) should apply for both m ~ H and m^> H. Thus in those cases, up to 0(1) 
factors, 

5 ~ <?f ■ (309) 

On the other hand, when m <C ff, there is a divergence that worth noticing. It can be shown that in this limit, 
if one take the bounds in the in-in integral as — oo < r < 0, 8 ends up with a divergence in the small m limit: 
8 ~ 72g|i7 4 /(2m 4 ). However this divergence is not real because inflation will eventually end. If we instead take 
m = and cut the in-in integral as — oo < r < r c , we have 8 ~ log 2 r c . Thus in the m <C H limit, we have 

8 ~ max [ — j-, log 2 r c ] ~ max [ —r, 360 ] , (310) 
\ m 4 / V m 4 / 



where we have assumed 60 e-folds of observable inflation. It turns out that when m ~ 0.3-ff (which is actually not 
much smaller than H), we will already have to consider the e- folding number limit of inflation. 

The IR growth of m <C H case is straightforward to understand: for a massless field, when there is a fluctuation, 
the fluctuation will continues interacting with the adiabatic perturbation all the time until the interaction is turned 
off (for example, reheating). In flat space QFT, the cross sections and decay rates arc defined as a finite value per- 
unit time. Thus there is no wonder the interaction rate diverges with a diverge duration of time. 
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FIG. 29: The transfer vertex in quasi-single field inflation, and the correction to the power spectrum, due to the transfer 
vertex. 



4- Amplitude of non-Gaussianities 



The amplitude of non-Gaussianity can be estimated similarly to the power spectrum for quasi-single field inflation. 
The bispectrum can be estimated by the coupling 



A£ 3 = V"'8a 3 



(311) 



Note that V'" has mass dimension. Thus it is natural to have V" ~ H during inflation, similar to the m ~ H 
argument. And again similarly to the Tyy-problem, the V" term may have UV sensitivity driving V'" 3> H straightly 
to the UV-sensitivity scale. One can estimate 



AC 
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{V"'/H, if to ~ H 

V"'H 2 /m 3 , if to > H 
cutoff needed if m <C H 



(312) 



Here we shall focus on the weakly coupling case 53 -C 1, leaving the more general case to Sec. IV C 6 

We should note that this interaction g% is huge, compared with the inflaton counterpart, which is offered by a slow 
roll potential V SI . As shown in Eq. (176), the slow roll potential is suppressed by 



V'l 



HO{e 2 ) 



(313) 
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And we know this V^.' generates sub-dominate non-Gaussianity for single field inflation with amplitude f^L ~ C(e 2 )- 
Thus we immediately know that the /™^ in the entropy perturbations should be ~ (73/ y/P;, which could be huge 
considering l/^/P^ ~ 10 4 . 

Here we obtain the l/^/Pf factor by comparing with the single field potential. The same estimation could either 
be obtained as follows: The three point function is (CCO ~ P^ 2 x (coupling constant). On the other hand, the 
bispectrum estimator /jvx is multiplied with P^ in its definition. To compare them we need to divide the result by 

Now we have got the amplitude of non-Gaussianity in the isocurvature direction. However, this entropy perturbation 
needs to be converted to adiabatic perturbation. We have shown the conversion is mediated by 52 coupling. As a 
result, the amplitude of non-Gaussianity for quasi-single field inflation can be estimated as 



Jnl ~ ^ ■ (314) 



'Pi 



This amplitude is illustrated in Fig. [30} 
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FIG. 30: The interaction vertex and the three point correlation function. 



Similarly, the analogue of g^L and tnl can also be estimated. As we have shown in Sec. IIIF2 tnl is dominated 
by the diagram with an internal propagator, and g^L is dominated by the other diagram. Thus we have 

9 nl ~ gi(92f/Pc , r NL ~ (g 3 ) 2 (g 2 ) 4 /Pc ■ (315) 
When g2 is small (which is the case we can trust our calculation), we have 

TNL » f NL ■ (316) 

Thus the Suyama-Yamaguchi inequality is satisfied. The graphical understanding is given in Fig. |32| One can also 
estimate the higher order correlation functions |117j . The t^l analogue of the n-point function scales as 

f ( NL~(9 3 /VPc) n ~ 2 (92) n , (317) 

where we have factorized the product into two groups, one scales with power n — 2 and another scales with n. Those 
two groups follows different interaction structures: 

• The power n — 2 group shows how the non-Gaussianity is generated. This is because for three point interaction 
(which generates Jnl, t nl,' ■ ■ , and 9nl belongs to yet another case), we need n — 2 vertices to have n-point 
function. 

• The power n group shows how the non-Gaussianity is converted from entropy direction to curvature direction. 
This is because for n-point function, we need at least n transfer vertices to convert all external legs to adiabatic 
perturbation. 

This scaling behavior is discussed in more details in |203j . known as the feeding mechanism. 



5. Shape of non-Gaussianities 



Now comes the shape of non-Gaussianity. To get some intuition, we may consider two limits: 
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FIG. 31: The quasi-local analog of g^L (left panel) and tml (right panel). 




FIG. 32: The Suyama-Yamaguchi relation. Here we have tnl 3> (36/25),fjv L , as long as the transfer vertex is perturbative. 
In case 52 > 1) our perturbative calculation cannot be trusted. However, the Suyama-Yamaguchi relation should still hold. 
The implication is that, resummation should suppress the large coupling. 



• When m <C H, quasi-single field inflation approaches multi-field inflation and the shape should approach local 
shape. This is because when there is an entropy fluctuation, in the massless limit the fluctuation do not roll 
back. Thus the interaction continues on super-Hubble scales. From causality, the non-Gaussianity should be 
dominated by the local shape. 

• When m ^> H, the kinetic term of a becomes hard to excite. Thus the heavy field just stays at the effective 

minimal of the potential, determined by the centrifugal force V'(a) = R9 . To perturb this equation, additional 
. 2 

factors of 69 comes into the action of 59, which modifies the sound speed c s of 69 perturbations. As a result, 
the non-Gaussianity should be very non-local. 

In general, we expect the shape should be in between the above cases. In the squeezed limit k\ -C k% — ^3, the k- 
dependence of the three-point function can be calculated explicitly [98] ■ Here we will not go into the details. Instead, 
as observed in [155] . the squeezed limit of quasi-single field inflation can be estimated as follows: The long wave length 
mode crosses the horizon at |fciTi| ~ 1. Later, the short wave length mode crosses the horizon at Ife^l ~ 1- Between 
time n and T2, the massive field a has a decaying solution: 

/ \ 3/2-1/ v 3/2-1/ 

cTfefaWkifa) (yj ~ ak ^{]r 2 ) . ( 318 ) 

where v = \/9/4 — m 2 /H 2 . 

In the squeezed limit, k\ <§; k?,. At T2, the k\ mode can still be considered as a shift of background. But the shift 



has a decay factor (318). As a result 



j£j^ CM " fepf (|) K fcp^ ■ (319) 

This analysis can also be made more rigorous [121] . Also, in the special case v = 0, a logarithm factor appears in the 
squeezed limit [9"8] . 

The exact shape of three-point function is quite complicated and plotted in [98j . Phenomenologically, the dimen- 
sionless shape function can be approximated by 

3§ ~ 3u (k 2 + k 2 2 + fc 3 2 )(fc 1 fc 2 A:3)5- l/ (fci + k 2 + k 3 )-i+ 3 » . (320) 
This shape function is simple, but unfortunately not factorized. One can nevertheless write a factorized shape near 



the squeezed limit [204] . The shape function (320) is plotted in Fig. 33 

It is interesting to notice that, the shape function for m — H/2 already looks very similar to the local shape. Indeed, 
a Fisher matrix analysis shows that the m < H/2 shape and the local shape are not distinguishable in the next 10 
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years' experiments |204j (see also |205j ). In case the local shape non-Gaussianity fits future data, it is challenging 
to distinguish if the non-Gaussianity comes from a massless held, or from a m < H/2 held and the corresponding 
quasi-local shapes. On the other hand, shapes from greater mass (such as m = H) will be distinguishable from local 
shape. 



quasi-single field (m = 3H/2) quasi-single field (m = 1.414_ff) 




quasi-single field (m = H) quasi-single field (m = H/2) 




FIG. 33: The shapes of non-Gaussianity for quasi single field inflation, for m = 3H/2 (i.e. v = 0), m = 1A14H (i.e. v = 0.5), 
m — H (i.e. v — 1.12) and m — H/2 (i.e. v = 1.414) respectively. The shape ansatz ( |320[ ) is used. Note that the change of 
shapes are very sensitive to the a mass in Hubble unit. 



6. Scaling dimensions for general operators 



In the above discussion of quasi-single held inflation, we have restricted our attention to the weakly coupled case. 
In the strongly coupled case, perturbative methods of directly calculation breaks down. However, one can still try 
to resum the diagrams, seeking a weakly coupled effective field theory. For example, the transfer vertex can be in 
principle resummed using the Schwinger-Dyson equation, as illustrated in Fig. [34] 

Alternatively, one can generalize the weakly coupled picture without doing the resummation. Especially for a 
conformal held theory (CFT) the interaction structure is highly constrained by symmetry (101| . In CFT, mass and 
interaction operators are characterized by their scaling dimensions. In this case, the squeezed limit of correlation 
functions are as functions of the scaling dimensions of the operators. 



The scaling dimension of the mass operator is an analogue of the example of Sec. IV C 3 However, a non-trivial 



scaling dimension of interaction operators open up new possibilities. For example, one can imagine a sector of 
isocurvature perturbations which is strongly coupled on sub-Hubble scales, but weakly coupled on super-Hubblc 
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FIG. 34: Resumming the transfer vertex. Note that the above diagrams represents a set of integral equations. When V'" 
and/or other interactions are turned on, the resummation becomes more complicated but yet possible. 

scales. The transition is controlled by the RG-fiow and the anomalous dimension of the interaction operator. In 
this case the anomalous dimension dynamically generates a scale, analogous to Aqcd- One can compare this scale 
to the Hubble scale, and generate a one parameter family of non-Gaussianities. New possibilities such as the nearly 
orthogonal shape arise compared to the simple model that we reviewed in detail. 



D. Scale dependent features 



So far the inflation models we have reviewed are nearly scale invariant. Such models are natural because the observed 
power spectrum is nearly scale invariant. However, it is still possible that there are small ripples in the inflationary 
power spectrum, that we have not yet observed. Or the observational effects hide in higher order correlation functions 
or gravitational waves, which we haven't yet observed. Here we review a few models of this kind. 



1. Features in the potential 



Consider single field inflation, where the potential is approximately flat but there are small features on top of that. 
For example, one can model a feature as [206] 



V{<t>) = \m 2 <f 



I + c tanh 



(321) 



The potential is illustrated in Fig. [35] 

Alternatively, one can choose a feature with a non-continuous slope 207 : 



V a + A + ( 
V + A< 



■-<f>o), if > 0o 
■ -<£o), if < 0o 



(322) 



A sharp feature can in general enhance slow roll parameters. The enhanced slow roll parameters result in a feature 
in the power spectrum, as well as large non-Gaussianity. However, it is useful to estimate which slow roll parameter 
is enhanced most. Considering the sharpness of the feature, higher time derivatives get enhanced most. 

In the third order action of standard single field inflation (174 1, the highest derivative is fj. Thus the interaction is 
dominated by 



n 3 d 



a 3 e 



(323) 



The shape of the bispectrum can be calculated numerically. It is intuitive to see that the non-Gaussianity is now scale 
dependent. This is because the far sub-Hubble modes has time to relax back to the vacuum and the far super- Hubble 
modes are conserved already. The most-affected modes are the Hubble crossing ones. In the bi-spectrum, they are 
the ones |(fei + + k^r^l ~ I, where r* is the time when the inflaton rolls onto the feature. As a result 205), the 
shape function is modulated by an oscillation sin[(fci + ki + ka)T*]. 
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FIG. 35: A feature in the potential. The solid line denote the actual feature considered in [206], with c = 0.0018 and d = 0.022. 
The feature is actually too small to be observable on the plot. The dot line is just an illustration of the feature, choosing by 
hand a huge feature (c = 0.1), which is clearly ruled out by experiments. 



2. Sharp turning of trajectory 
Here we review a sharp turning of inflationary trajectory 1209H217] . For example, there is one inflation direction, 



and another very massive direction. The turning of trajectory is between those two directions, as illustrated in Fig. 36 



A 





e 





2 sin 9, where 6 is the turning angle. This amount 
(<p/m) sm8. In case this offset in the a direction 



FIG. 36: A sharp turning of trajectory. 

During a sharp turning, a combination of effects may happen: 

• Background oscillation: When the trajectory turns very sharply, the kinetic energy of the infiaton gets projected 
on to the massive direction. The amount of transfer energy is | 
of energy drives the massive direction a to move as high as a <■ 
is not cancelled by a linear potential, the a held get released from this height and start to oscillate. 

The background oscillation is not just a shift of background. Instead more dramatic things can happen. Note 
that the perturbations has varying physical wave length k/a. Once the k/a for a perturbation mode gets close 
to the oscillation frequency of the background, parametric resonance can happen. This situation is similar to 
the case of preheating. As a result, both the linear perturbation and non-Gaussianity are enhanced. 

It is interesting to note that, both the background oscillation itself, and the induced effects such as the resonant 
enhancement of perturbations, are periodic (with a decaying envelope) with respect to physical time. On the 
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other hand, the comoving wave number k is defined with respect to the conformal time. Thus in case one 
observes such a periodic feature in the sky, the function dependence between the physical time and conformal 
time is obtained. This function dependence differs significantly between inflation and alternative to inflation 
models [2TT1I2T5] . 

In case the turning of trajectory is not sharp enough, or a linear potential comes to cancel the centrifugal force 
during the turning of trajectory, the background oscillation may not be present. 

• Sharp feature: During the turning of trajectory, the slow roll parameters got boosted thus there is an similar 
effect as discussed in Sec. IIVDI 

• Vacuum projection: The wave function of a quantum harmonic oscillator depends on the frequency of the 
oscillator. The same is true for scalar fields. The wave function of a massive direction is different from the 
wave function of a massless direction. However, after a sharp turning of trajectory, the two wave functions gets 
mixed. As a result, the perturbations of the fields are no longer in the vacuum state. 

In terms of A;-modes, the perturbation modes with H < k/a < m are affected. The mixing, which can be 
calculated using the in-in formalism, kicks the perturbations even if there is no background oscillation. 



3. Particle production 

Despite the wording "slow roll", the inflaton field actually runs very fast, with <j) ^> H 2 . Thus it is useful to check 
if the fast motion of the inflaton pollutes the inflationary environment. For example, when other fields have coupling 
to the inflaton, there may be particle productions induced by the motion of the inflaton |219l f222j. 

For example, consider an otherwise massless field x, with the interaction Lagrangian 

A„t = -y (0 - 0o)V ^ -\ml s {4>) X 2 ■ (324) 
The motion of the inflaton provides an effective mass for x- The equation of motion for \ is 

d 2 

X + 3H X +^X + m 2 eS (4>)x = . (325) 
or 

When the change of background is slow, the x field has a WKB solution. As a result x almost stays unchanged 
without particle production. However, when cf> moves cross 4>o, the situation changes dramatically. Not only that x 
gets effectively massless, but also the WKB condition breaks badly: 

m oft g{<p-<l>o) 

Thus particles of x get produced. To see the effect more closely, one can parameterize the behavior of (j) near (j) as 
(j) ~ 0o + vt. Solving the EoM of x, the occupation number can be calculated as 

n k = exp (-^-r) ■ (327) 
V 9\v\J 

After the moment <p = <pQ, the produced particles quickly become massive. The mass increases while the number 
density is diluted. The back-reaction from the produced particles to the adiabatic perturbation can be calculated. 
The power spectrum gets a correction. For the bispectrum, for small k the shape of non-Gaussianity is similar to 
local shape. For larger k there are oscillations in the shape function |221j . 

It is also interesting to put this scenario onto a Landscape type potential. When the number of fields increase, it 
is exponentially likely for the inflation field to encounter particle production events (with a small mass of x) [222 1. 
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Appendix A: Computer Assisted Analytical Calculation 

Although the physics of inflationary perturbations is interesting, the calculations are sometimes tediously long. 
Fortunately, nowadays computers are in the position to assist us for this purpose. 

1. Algebra systems 

There are quite a few choices of algebra systems in the market. Among those, there are propriety software such as 
Mathematica and Maple, and free software such as Sage and Maxima. Here we comment the pros and cons for our 
particular purpose (there are of course other choices as well) : 

• Mathematica: The functional programming paradigm is typically convenient for math calculations. Also there 
is a powerful mechanism of pattern matching, related to its functional programming nature. There are also GR 
packages such as XACT available on Mathematica. 

• Maple: There is a very powerful package on general relativity: GRTensor, on the Maple platform. 

• Sage: Sage is a collection of free math software. Those lower level software use different languages and conven- 
tions. And sage unifies them with an interface using the Python programming language. Python is strong in 
object-oriented programming, in contrast with the functional programming approach taken by Mathematica. 

• Maxima: Currently Sage use Maxima as back-end. In the analytical aspects, Maxima itself is feature complete, 
if one would not need the conveniences provided by Python. 

The examples of calculating cosmic perturbations using Mathematica, Maple and Sage are given at 
https : //dl . dropbox . com/u/49916494/Yi-Wang-pert-sample-code . tar . gz 

Not every part of the code is thoroughly checked thus please double check before usage. From now on we take 
Mathematica as an example for discussion. 

2. Summed indices 

One annoying fact is that computers are usually not clever enough to combine quantities in desired forms. Especially 
they typically do not collect summed indices. Algebraic systems as Mathematica will leave you something like 
d\d\f + 8282/ + 8383/ instead of didif. There are at least two tricks one can do for this: 

• Replace the dummy indices: We can define a replacement rule 

expression /. { D[f_,xl,xl] -> p2[f] - D[f,x2,x2] - D[f,x3,x3] } 

Note the underscore here. This underscore matches and replaces didi acting on all variables, not only f itself. 
We will also meet a lot of other combinations that we need to design rules to match and replace. 

• Keep them abstract from the beginning: For most of the cases, we don't need to actually calculate the summation 
of spatial indices, we just need to write them in a compact form. For this purpose, we can directly write didif 
in the code. It is (typically) elegant and not hard to do this. We have to note: 

— The default derivative operator tries to calculate dif explicitly. This is not the desired behavior when 
we decide to keep didif as abstract. To solve this problem, we can define derivative operators ourselves. 
The definition should include linearity and the Leibniz's rule for multiplication, without defining how to 
calculate dif explicitly. It is also convenient to use the "Notation" package to overload the d operator. 
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— Dummy indices may conflict. For example, how does Mathematica write (<9 2 /) 2 as didifdjdjf instead of 
as didifdidifl For this purpose, the Unique[] function is useful. We can write 

(\partial_# \partial_# f )& [Unique [] ] * (\partial_# \partial_# f )& [Unique [] ] 



Then Mathematica automatically assign different dummies for different terms. It is also not hard to write 
a function to simplify dummies. 



3. Integration by parts 

In the code calculating the action of perturbations, one frequently need to integrate by parts. The situations can 
be divided into 2nd order or higher order actions, and spatial or temporal directions. For 2nd order spatial direction, 
one can go to momentum space to get rid of integration by parts. However, for higher order, or temporal direction, 
going to momentum space does not help that much. 

One general way to do integration by parts is to use the Mathematica function ReplaceList [] . This function tries 
all possible replace rules (which we write ourselves). We just need to setup some measure to compare the result of 
each replacement and find a good one. We can repeatedly call ReplaceList [] (or multiple ReplaceList [] in a row) until 
the result is simplest. 
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